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On Delta Modulation 


By DAVID SLEPIAN 


(Manuscript received May 3, 1972) 


We show how the steady-state distribution and the mean squared error 
of a delta modulator with an ideal integrator can be computed exactly 
when the input signal to the modulator ts a stationary Gaussian process 
with a rational power spectral density. Curves are presented for the mean 
squared error as a function of the quantizer step stze and the sampling 
interval for several different input spectra. The mathematical development 
makes use of the Alarkov properties of the system and involves series ex- 
expansions in n-dimensional Hermite functions. The key integral equation 
as generalized to treat the case of a realizable filter in the feedback path, but 
an analytic method of solving this equation has not been found. 


I. INTRODUCTION 


Demand for the transmission of digital data grows apace as the 
computerization of our society continues. This demand, coupled with 
the many recent striking advances in solid state circuit technology and 
with new concepts of digital switching, assures an increased role for 
digital transmission systems in the near future. The existence of such 
systems in turn gives new importance to digital means of transmitting 
analog signals. This paper is concerned with one such means—delta 
modulation. 


2101 
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In its simplest form, as depicted in Fig. 1, the delta modulation 
transmitter approximates a continuous input signal X(t) by a staircase 
signal Z(t) that has treads of duration 7 and risers of height A. Every 
T seconds the staircase either rises one step or falls one step in order 
to approach X(t) at that instant more closely. At each rise or fall, 
the delta modulator emits a binary digit that specifies the direction 
of the step just taken. At the receiver, these transmitted binary digits 
are then used to reconstruct Z(t), or perhaps a smoothed version of it. 

This system was first described in the literature in 1952.’ Because of its 
extreme conceptual simplicity, and its relative ease of instrumentation, 
delta modulation has attracted the attention of theorists and experi- 
mentalists alike, and many studies of it and its generalizations have been 
undertaken in the ensuring years. Many of these have been concerned 
with calculation or measurement of the mean squared error suffered by 
signals transmitted by delta modulation and with determination of 
how this quantity varies with the parameters of the system. Almost 
without exception, the theoretical studies are based on approximations, 
the range of validity of which is difficult to determine. 

The present paper is also concerned with the mean squared error 
inherent in delta modulation. Our attention is focused on stationary 
Gaussian input ensembles X (¢) that have rational power density spectra. 
For this class of inputs we show that the mean squared error can indeed 
be computed exactly for the simple modulator of Fig. 1. 

Since the mathematical analysis entailed tends to become quite 
involved, we have organized the paper into three main parts. Section II 
presents definitions, discussion and the results of numerical work. It is 
free of laborious mathematical derivations and is intended for the 


AMPLITUDE X/A 





TIME t/T 


Fig. 1—The waveforms of a simple delta modulator with ideal integrator. 
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casual reader. In Section III a detailed mathematical treatment leading 
to a means for computing the mean squared error is given along with 
some necessary additional theory. Section IV describes a generalization 
of the present study to systems with realizable filters in the feedback 
loop. 


II. DEFINITIONS, DISCUSSION AND RESULTS 


2.1 Some Definitions and Descriptions 


The modulator described in the Introduction can be defined with 
mathematical precision as follows. A signal X(¢) is given for t = 0. 
Also given are a sampling period T > 0, a step size A > O, and an 
initial value h. Numbers Z; , j = 0, 1, 2, --- are defined recursively by 


Le = hh; 
zZ, = iS +A, XGT) > Zi 
Z;-. — A, AGT) SA 3 
The delta modulation approximation signal is then given by 
4)=2Z2,, j7 St<G+DT, 7 =0,1,2,---. ©) 


Notice that Z(é) can only take on values from the set 8 = {--- h — 2A, 
h—A,h,h + A, h + 2A, ---}. Indeed, the allowed values of Z(¢) are 
restricted in a periodic way. If ¢ lies in an even interval, i.e., if 2n7 S 
t< (Qn + 1)T for some n = 0, 1, 2, --- , then Z(¢) must take a value 
from the set 


8 = {---h—4A,h — 2A,h,h + 2A, h + 4A, ---}. (3) 


If ¢ lies in an odd interval, i.e. if Qn + 17 St < Qn + 2)T for some 
n = 0,1, 2, --- , then Z(t) must take a value from the set 


8S ={---h—-—3A,h-A, h+A,h + 34, ---}. (4) 


gS 21; 2, By PF? 5 (1) 


Due to the non-linear nature of (1), it is very difficult to say much 
about how well Z(¢) approximates any given signal X(¢), nor is this 
question of any real importance in a communication setting. What 
matters is how well Z(t) does on the average in approximating the 
members of an ensemble of functions that represents an analog informa- 
tion source. Thus we are led to consider the delta modulator described 
by (1) and (2) when X(¢) is a sample function of a stochastic process. 
Throughout the paper we shall restrict our attention to the case in 
which X(¢) is stationary and satisfies the conditions 
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EX(t) =0, EX?() =1. (5) 


The latter constraint sets the scale by which signal power is measured. 

With X(t) stochastic, Z(t) becomes a dependent stochastic process, 
and we can speak of the joint distribution of X(t) and Z(t) at any set 
of times, O St, < 2 --- < t,. Even when X(f) is a stationary process, 
Z(t) will not in general be stationary, and the joint distribution of X(t) 
and Z(t) at times j7 + ¢,,j77 + t., --- ,j7 + #, will depend on the 
integer 7. Real world delta modulators, however, ‘‘settle down’’, and 
hence one would expect the distribution just referred to to approach 
a limit as 7 — ©. Unfortunately, there are some subtleties to this notion 
due to the periodic nature of the allowed values of X(t), as already 
mentioned. Under suitable regularity assumptions, one limiting dis- 
tribution will be approached as 7 — o through even values 7 = 2m, 
m = 0, 1, 2, --- ; another will be obtained as j — © through odd values, 
j = 2m + 1,m = 0,1, 2, --- . We call the average of these two limit 
distributions ‘‘the steady-state distribution.”’ It describes the settled 
down behavior of the delta modulator. The marginal distribution of 
X(t) computed from this steady-state distribution is, of course, still 
the original given distribution for X(Z). 

The conditions under which the statistics of delta modulators approach 
limiting forms as just described have been investigated by Gersho.* 
His work shows that for the cases treated in this paper, the limits 
referred to above exist, and that the density of interest here is given 
by the unique normalized solution of our key equation (22). 

We now measure the accuracy of the delta modulator by the mean 
squared error 


é=(A,D=E 7 | “(XW — ZY at 


where X(t) and Z(t) have the steady-state distribution and H denotes 
expectation. Our main interest is on how € varies with 7, A, and the 
statistics of X(t). 

Delta modulation is frequently described by passing reference to a 
block diagram such as is shown in Tig. 2. (The box labelled ‘“‘filter’”’ 
is called a “perfect integrator’ for the case at hand.) On the surface, 
this appears to be much more succinct than (1) and (2) and the subse- 
quent limit discussions. Figure 2 describes a recursive situation, however, 
and so fails to define anything at all unless supplemented with side 
information that either permits the recursion to be started, or serves 
otherwise to define a joint distribution for X(é) and Z(t). Analyses of 
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delta modulation based on Tig. 2 with unstated initial conditions, and 
no limiting arguments are apt to be approximate. 


2.2 Some Heuristics and History 


Let us consider ¢ as a function of A for a fixed sampling period 7 and 
for fixed-input ensemble statistics satisfying (5). If A is extremely 
large compared to unity, then for the most part of its history Z(d) will 
alternate between the level h and one of the two levels h + Aorh — A. 
Thus one expects the asymptotic result 


ée (A, T) ~ 14’ 


as A — o. On the other hand, if A is very small compared to unity, 
Z(t) will rarely wander far from its initial value h and one expects 
the result 


lim &(A, T) = E[LX() —h?P = 14+ 2’. 
A-0 


(The rate at which ¢& > 1 + h* as A— 0 isa more subtle question that 
requires detailed analysis.) Thus the curve of ¢(A, 7) vs A starts 
at ¢ = 1+ A and ultimately rises like }A*. How does it behave in 
between? Does it always dip yielding a best value for A, i.e., a positive 
value for which e’ is least? 

There have been many analyses of delta modulation in the past. 
The few listed here,’’®'’ provide entry to the literature. Many of them 
predict the existence of a best A > 0 for any 7. Their analysis is based 
on the notion that the total error is the sum of two kinds of error- 
quantization error and slope-overload error. The delta modulation 
signal Z(t) can climb or fall at a maximum average rate of A/T = &, 
so that if | dX /dt | exceeds & for a length of time much greater than 7, 
a serious tracking error will occur. Such a “region of slope overload” 
is seen in Fig. 1 for 8 S$ t/T S 11. In the region 0 S ¢/T S 8 of Fig. 1, 
| dX/dt| < € and the error here is classified as ‘quantization error’. 


QUANTIZER 
r-A 


X(t) SAMPLER 
PERIOD T 





FILTER 


Fig. 2—Block diagram of delta modulator with general feedback filter. 


2106 THE BELL SYSTEM TECHNICAL JOURNAL, DECEMBER 1972 


This notion of two sorts of error has been very fruitful, and approximate 
calculations based on it agree well with experiment when 7 is small 
compared to any natural period associated with X(t). This, of course, 
is the case of interest in practice. The calculations are based on many 
approximations, however, and it is difficult to determine their precise 
range of validity without recourse to experiment. 

Two notable exceptions to this approach to the mean squared error 
are the exact treatments by Fine’’ and Aaron and Stanley.’’ The former 
treats a time discrete model with the input process X(n7T),n = 0,1,---, 
restricted to have independent increments. Aaron and Stanley treat 
the case in which X(¢) is a binary random telegraph signal. Neither 
of these cases is applicable to the transmission of speech or to continuous 
amplitude television signals. The work by Aaron and Stanley, however, 
has much of the flavor of the present study and presents a one-dimen- 
sional version of our key integral equation (22). Closely related work 
is also to be found in the papers of Davisson** who presents an integral 
equation and suggests a solution in a series of Hermite functions. 


2.3 Results of Computations 


The method described later in this paper, in principle, permits exact 
calculation of ¢? whenever X(t) is a stationary Gaussian process with a 
rational power spectral density, 


Ile +0) 


Pa) = k= 
I (w” + di) 


(6) 


where m < n and w = 2rf is the angular frequency. The complexity 
of the computation grows rapidly with n and consequently we have 
done numerical work only for n = 1 and n = 2. The method involves 
series that unfortunately converge slowly for small 7’, so that we have 
not been able to explore the interesting region of very small T. 

Figure 3 shows plots of « vs A when the input process X(t) has 
spectrum 


2 


tte (7) 


Prc(w) = 


The corresponding covariance is 
pre(t) = BX(OX(t+ 7) =e". (8) 


We refer to this as the RC-noise case. 
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RC—NOISE SIGNAL, h=0 


Fig. 3—Curves of é vs A for delta modulator with RC Gaussian-noise input. 


On the curves of Tig. 3, a cross points out the optimal value of A, 
that is, the value A,i, (7) that minimizes ¢. As T' goes to zero, Amin 
decreases (slowly) and the corresponding error decreases rapidly. As 
T increases, however, A,jn reaches a maximum, then starts to decrease 
once more toward zero. Note that for large sampling times (7' > 2.2, say) 
the delta modulator performs poorly indeed. As far as mean squared 
error is concerned, at these rates one would do better by taking the 
constant zero as an approximation to the input than by using the 
delta modulation signal Z(2). 

Figures 4 and 5 show the somewhat similar results obtained for the 
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2(at+b) 
(w2 + a2)(w24+b2) 


 (o) = 





0 0.2 0.4 0.6 0.8 1.0 1.2 
A 
DAMPED RLC—NOISE SIGNAL, h = 0, a= 0.25, b= 4.0 


Fig. 4—Curves of 2 vs A for delta modulator with damped RLC Gaussian-noise 
input. 


input spectrum 


2(b 
Ppricw) = ES , ab = 1, (9) 
corresponding to the covariance 
1 -altl —blr! 
Poric(t) = ; ea [be — ae } (10) 


When a and 0b are real, we refer to the input with spectrum (9) as 
damped RLC noise. The spectrum in this case is unimodal with its 
maximum at the origin. 


When 
a=a+t ig, b=a — 7, (11) 
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with @ and @ real, (9) and (10) become 


4a ‘ 2 
Pperic(w) oa [wo = (B° = a’)]? a 4a’ B? } a + B — 1. (12) 





—alrl 


peruc(t) =~ 3 [asin 6 | 7 | + 8 cos 6]. (13) 





For this ‘resonant RIC noise’’ case, the spectrum (12) develops a large 
narrow peak at w = las a — 0. Figures 6 through 9 show the curious 
resonance phenomena that set in as a — 0 and the input signal becomes 
more and more sinusoidal in nature. For the limiting noise obtained 


2 (a+b) 
(w2+a2) (w2 +b2) 


® (w) = 


€2 





0 0.2 0.4 0.6 0.8 1.0 1.2 
A 
DAMPED RLC—NOISE SIGNAL, h=0, a=0.909, b= 1.100 


_ Fig. 5—Curves of 2 vs A for delta modulator with damped RLC Gaussian-noise 
input. 
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4a 
(w2+a2— 82)2 4402 B2 


(P(m) = 





0 0.2 0.4 0.6 0.8 1.0 1.2 
A . 
RESONANT RLC—NOISE SIGNAL, h=0, @ = 0.900, 8 =0.436 


_ Fig. 6—Curves of é vs A for delta modulator with resonant RLC Gaussian-noise 
input. 


when a — 0, the single frequency Gaussian ensemble, one can compute 
e by other methods and anomalous shapes for large 7 similar to those 
of Fig. 9 are found. 

In the range ¢ < 0.4, the curves of Figs. 3 through 9 agree roughly 
with values computed by O’Neal” and others. These comparisons can, 
at best, yield approximate agreement, since they are among systems 
differing in a number of assumptions including the spectral shape of 
the signal. The approximate methods, based on quantization noise and 
slope-overload noise will probably continue to be used in practice, as 
they are much simpler to use than the scheme given here. The present 
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curves do, however, provide exact values for comparison purposes, and 
this is perhaps the main practical contribution of this paper. 


2.4 Outline of Mathematical Argument 


In this section we outline briefly the mathematical argument of 
Section III, and point out some of the formulae used to obtain the 
numerical results of the preceding section. 

A stationary Gaussian process X(¢) with the rational power spectral 
density (6) can always be written as the first component of an n-vector 
Gaussian process 


4a 
(w2+a@2—B2)24+4a2 B2 





D(w)= 





0 0.2 0.4 0.6 0.8 1.0 1.2 
A 
RESONANT RLC—NOISE SIGNAL, h=0,«« =0.400, B =0.916 


_ Fig. 7—Curves of & vs A for delta modulator with resonant RLC Gaussian-noise 
input. 
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4a 
(w24a2 — B2)2444282 


P(w) = 





0) 0.2 0.4 0.6 0.8 1.0 1.2 
A 
RESONANT RLC—NOISE SIGNAL, h=0, a = 0.100, 6 =0.995 


Fig. 8—Curves of e vs A for delta modulator with resonant REC Gaussian-noise 
input-narrow band case. 


that is Markovian.” It is not difficult to see then that the n + 1 quan- 
tities X,(j7), XT), --- , X,GT), Z;-1 for 7 = 1, 2, --- , forma 
time-discrete vector Markov process. The first » components can take 
any real values, but the last component is restricted to alternate between 
values in the sets 8, and 8, of (8) and (4). The stationary measure, or 
what we call the steady-state distribution, m,(x), satisfies the Chapman— 
Kolmogorov equation (22) with the boundary conditions (23). The 
notation is explained below (23). 

The kernel pr(y | x) of (22) can be developed in a multiple power 
series in the cross-correlations 6;; defined in (31). This power series 
resembles Mehler’s formula and involves certain functions ¥;(x; a) of n 
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variables wt; , %,--: , “, that we call n-dimensional Hermite functions. 
They are defined in (24) and (39). The parameters a here are the corre- 
lations (26). The expansion of the kernel is given in (46) in a highly 
symbolic form. To understand this equation fully, Section 3.2 and the 
first paragraph of 3.8 must be read. 

The expansion (46), in turn, suggests the expansion (47) of the steady- 
state distribution. We write that symbolic equation in full here: 


m,(x) = > > aa > > ie > [Tr TL oc | 


¥11=0 y12=0 Ynn=0 11,=0 T,=0 j=1 k= 
T Fashawes peck hareite Vi cigcei A& a). 
Thus vis an n X 7 matrix of indices, and 1 is an n-vector of indices. 


1.6 


1.4 


1.0 


€2 


0.8 


0.6 


0.4 


4a 
(w2+a2—-B2)2 +4a2B2 


0.2 (w) = 








0 0.2 0.4 0.6 0.8 1.0 1.2 
A 
RESONANT RLC-NOISE SIGNAL, h=0, @ =0.050, 8=0.999 


_ Fig. 9—Curves of é vs A for delta modulator with resonant 2LC Gaussian-noise 
input—very narrow band case. 
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Substitution of (47) and (46) in the integral equation (22) yields the 
recurrence (52) for the expansion coefficients f;,;. This equation in- 
volves quantities p,;,, and qg;,, defined in (49) and (50). Section 3.4 
shows how these quantities can be computed recursively. 

The remainder of Section 3.3 is concerned with solving the recurrence 
(52). The quantities fio9 are given explicitly by (64) for each 7 = 0, 
+1, --- . Equation (74), with the definitions (70) and (73), gives foo. 
The remaining /’s are given by (75) and (52) when these are applied in 
proper sequence. 

With the expansion coefficients f;,,; known, in principle one can write 
down the joint steady-state distribution of X(t) and Z(t) at any number 
of times, and from this quantity derive many statistical properties of 
the delta modulator. Our interest here has centered only on the mean 
squared error e’. In Section 3.5, an expression for this quantity in terms 
of the steady-state distribution m;(x) is developed. The expansion (47) 
is then used along with properties of the n-dimensional Hermite func- 
tions to obtain a formula, (100), for ¢ involving only the expansion 
coefficients f;,, and other known quantities. From this formula, the 
values shown on Figs. 3 through 9 were obtained. 


Ill. MATHEMATICAL TREATMENT 


3.1 The Integral Equation 


Let X(t), the input to the delta modulator, be a continuous stationary 
stochastic process with mean zero, normalized to have variance unity 
as shown in (5). We introduce the following notation: 


a;=h+iA, 1=0,+1, +£2,--- (14) 

XS XGP); j= 0,1,2,-:- (15) 

p(y | a) dy = Pr{y Ss X@+7) <y t+ dy |X = 2} (16) 

m;” (y) dy = Pr fy $ X; < y+ dy, Z;-. = a;} (17) 
j= 1,2,+--+, «= 0, £1, #2, °-°. 


Thus pr(y | x) is the conditional probability density of one sample of 
the input given the preceding sample, and m{” (y) dy is the probability 
that Z(¢) has the value h + 7A just before the jth sampling instant and 
that the jth sample of the input, X;, lies in a small range about the 
value y. 

The event Z;_; = a; appearing on the right of (17) can occur in two 
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ways: either Z;_, has the value a;_, and X;_; > a;_,; or else Z;_, = @i4, 
and X;~, S a;4, . Thus (17) can be written 


ms? (y) dy = Pr {y < XxX; < y + dy, Dar > Q;-4 iD tee = Q;-1} 
ae-FP Ly = X; <—Y = dy, X m1 S Qi41 , Zi-2 = Gist}. 


= dy | mE OQ | 2, i = 1) ae 


+ dy f° mE? QQ |e, 7+ 1) ae (18) 


where 


Qiy| a, 1) dy = Pr{y SX; < y + dy | Xj = 4, Zj-2 = aj}. 
Now, if X(t) ts Markovian, 


Qi(y | 2,2) = pry | z) 
and (18) becomes 


my) = i ms" (pry | 2), da 


+f m5? (poly | 2) de. (19) 


The pair of processes X(t) and Z(t) then form a 2-component vector 
Markov process. One component, Z(é), takes discrete values; the other, 
X(t), takes continuous values. Equation (19) is the Chapman-Kolmo- 
gorov equation for this vector process. 

We have commented in Section II that m{?? (y) and m{??*” (y) will 
in general have different limiting forms as 7 ~ o. By replacing 7 by 
7 + 1 in (19) and adding the result to (19), one finds that 


my) = Ame? (y) + mi?) 


also satisfies (19). Taking the limit as 7 — «, we then have 


my) = i m;:—(x)pr(y | 2) dx + [ Misile)prly |x) dx (20) 


ir 


where 


m(y) = lim my” (y) 


is the steady-state joint distribution for X(¢) and Z(t). Equation (20) 
must be supplemented with the boundary condition 
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2 me) = ple) (21) 
where p(z) is the probability density for X(#). 

The foregoing generalizes readily to the case in which X(t) is not 
itself Markovian but is one component, say the first, of an n-component 
stationary-vector Markov process. Denote this process by X(t) = 
{X,(t) = X(), X2(f), --- , X,()}. We imagine a delta modulator 
generating approximations Z; to X,(j7) in the manner already described. 
With an obvious extension of our previous notation, we find 


mig) = Jo dey [des [derma aero |) 


+ | dt, ++: i de, f° doymen(paly | x), 


t= 0, +1, +2,:-:- (22) 

2 mix) = P@). (23) 

Here, of course, x and y are n-vectors, p,(y | x) is the conditional 

probability density of X(¢ + 7) given X(é), p(x) is the density of X(), 

and m;(y) is the steady-state distribution for X(é) and Z(t), the index 7 

referring to the value a; = h + 7A for Z(t). Equations (22) and (23) 
are the basic ones on which this paper is built. 

In all that follows, we restrict our consideration to inputs X(¢) that 
are Gaussian. It is well-known” that if, in this case, X(t) has a rational 
power density spectrum of form (6), then it can indeed be written as 
the first component of an n-vector Gaussian process X(t) that is Mar- 
kovian. When m = 0 in (6), by which we mean that the numerator 
shown there is a constant independent of w, the higher order components 
of X(é) can be taken as the derivatives of X(2), ie., Xj41() = d’X(t)/dt’, 
j = 1,2,---,n—1. For the more general case m = 1, see the article” 
by Helstrom. 

To indicate in full the quantities appearing in (22) and (23) in this 
Gaussian case, we introduce some further notation. Denote the n- 
variate Gaussian density with zero means by 


1 | ee 
W(X; 0) = On” |p E&P (-1 2d piles) (24) 


where 0 is a positive definite m X n matrix, the inverse of which has 
elements p;;. The right side of (23) is then given by 
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p(x) = W(x; «) (25) 
where q@ is the covariance matrix of X(), 1.e., 
a: = EX,(t)xX;(, 1,9 = 1, 22s 5M, (26) 
The kernel of (22) is given explicitly by 
Dely | X) = pr(x, y)/p(x) (27) 
where 
pr(X, y) = WZ; @). (28) 
Here the 2n-vector z has components 
Bi ee a) ae t S152. Siem (29) 
and 9 has the special partitioned structure 
“hp 
6 a 
where 
By; = E[LX,OXE + T)I, %#j=1,2,--+,n, (31) 


a is given by (26), and the tilde denotes transpose. 


We shall show in later sections how explicit series solutions can be 
found to (22) and (23) in this Gaussian rational spectrum case. But 
first some further preliminaries are necessary. 

3.2 A Generalized Mehler’s Formula 

When n = 1, (24) becomes the standard normal density 

| ee 





W(x) = e/a" (32) 
Denote its derivatives by 
¥i(z) = 4 Y(x), ~=0,1,2,---. (33) 
Now a = | and 6 = @ and (28) has the series representation 
Pra, y) = VF exp (ee) 
= 5 vernal), (34) 
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an expansion known as Mehler’s formula.” It is the power series for the 
normalized bivariate Gaussian density in terms of the correlation 8 
between the variables. We need the corresponding multiple power series 
expansion of the 2n-variate density (28) in terms of the correlations 
G;; of (31). The series is derived in Ref. 18: to present it, yet further 
introduction of notation is necessary. 

Boldface lower-case Greek letters, u, v, etc., will be used henceforth 
to denote matrices; boldface lower-case Latin letters, 2, m, etc., will 
denote vectors. If v is a matrix with n, rows and n, columns, we write 


r(¥) >= (1, Te 5 ty Pay 
ne 
rh= dv; @=1,---,m (35) 
j=l 
for the vector whose components are the row sums of vy, and we write 
c(v) = (C1 , Ce C 5p Cxg) 
nt 
Ce De g=l,-++ Mm (36) 
t=1 


for the vector, the components of which are the column sums of v. 
Throughout we adopt the convenient abbreviations 


u _ IT ui’, r = IT 
t,7 1 


ws JTast = Ti! 

ty] t 
fe) fe] eo 0 co co ce} 
v=0 ¥11=0 vi2e=0 Ynin2=0 l=0 11=0 in=0 


where the entries of u are u.; , the components of l are /; , etc. We call a 
matrix of nonnegative integers, such as ¥ in the last line of (87), an 
index matriz; a vector of nonnegative integers, such as J, is an index 
vector. The statement s < t means that no component of s is greater 
than the corresponding component of t; the statement s < t means 
s Stands +t. Inequalities between matrices, e.g., u < vare to be 
interpreted in a similar manner. Finally, we write 


[Ste le ee Pd (38) 
for the sum of the components of a vector, and we define 
gi 
xo) = SE 39 
Wi(X; 9) eae cena ¥(x; @) (39) 


where the Gaussian density (x; 9) is given by (24). 
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The desired generalization of Mehler’s formula is 


Pr{X, y) = a Wry (Xj @) Pec (Y; a). (40) 


v=0 
It is a multiple power series for the density of two identically distributed 
Gaussian vectors in terms of the cross correlations between the com- 
ponents of the vectors. 

The functions y;(x; «) defined in (39) that occur in (40) are closely 
related to the Hermite polynomials of several variables studied by 
Erdélyi’’ and others. We call )(x; w) an n-dimensional Hermite function 
of weight [l] [see (88)]. The following facts about them that will be of 
use to us later are established in Ref. 18. 

(7) Ifl,,i,,--- , l, are r distinct n-vectors with nonnegative integers 
as components, then W;,,(x, @), W(x, @), --- W(x, @) are linearly 
independent functions of x. Hermite functions have the generating 
function 

co yl 
Ve + tye) = 2 Wiles) (41) 


which is just Taylor’s theorem in many variables. 
(1) There are 
Non,p) = ("$2 — 3) (49) 


n-dimensional Hermite functions of weight p. Functions of different 
weight are orthogonal with respect to the weight function 


w(x; 9) = ao: (43) 


(zat) The scalar product of any two functions of the same weight p can 
be expressed in terms of an N(n, p) X N(n, p) matrix, 6,(9°'), known 
as the symmetrized Kronecker pth power of 9 *. We have the formula 


Fai Tai ia day ++- ‘a dx, Wi(X; 0)Ym(X; 9)w(x; 9) 


= 51tm61n(@ tm (44) 


where 6;; is the usual Kronecker symbol. An explicit formula for the 
matrix 6, is 


6,(0)im = Vil Vm! >> ‘ (45) 


u 
r(we=l 
c(w)=m 

tll=[ml=p 
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As indicated, the sum here is over all index matrices yu with row-sum 
vector 1 and column-sum vector m, these latter being of weight p. 


3.3 Series Solution of the Integral Equation 


We now return to consideration of the system of eqs. (22) and (28) 
where the density p(x) and p(y | x) are defined by (24) through (31). 
To simplify notation we shall frequently write ¥(x) for W(x; «), the 
unexpressed matrix always being «. Unless otherwise explicitly stated, 
boldface Greek letters will denote n X n matrices while boldface 
Latin letters will denote n-vectors. 

Equations (40), (27) and (25) show that the kernel of (22) can be 
written 


= 6" Vriw (X) We (y) 
a ee Sa 


the conventions (37) being understood here. This suggests a series 
solution to (22) and (23) in the form 


mi(x) = > 8"fevr Wild). (47) 


v,l=0 


Pry |x) = 


Conditions on the coefficients f,;,, are then obtained by substituting (47) 
and (46) into (22). There results 


pa Bf Wily) 
= a u! = Pe Nc 168Qi-1 sry) + fit eDieis sia (48) 
where 
Dees a de, - a dv, [. de, ee) (49) 
and 
dice = fo deg + [es fo de, HE (50) 


On setting u = v — 4, the right of (48) becomes 
. oe 
m{y) :- 28 du We(v—ay (Y) (v a 3)! 
: > eek eels sieeGe aii dest wed oat eaten (51) 


But this form shows that in (47) the 2 sum could be restricted to run 
from 0 to c(v). This in turn restricts the s sum in (51) to run from 0 
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to c(é). Equating cocfficients of powers of 8 on the left of (88) and the 
right of (51) gives 
c(v) 


> feivily) = me 3 ecw) 3 


c(v—w) 


[fia vr sQi-1 s r(w) is fist vu sPi+1 s eta) | 


s= 


where we have written 6 = v — u to reintroduce u. Using the linear 
independence of the W,(y), we find finally 


1 c(v—p) 
fivt = -) 2, [fey y—u sVi-1s rw + si vw Der ertnds 
u * gs= 


O<l< cl), i=0,+1,+42,---, (52) 


which holds for all v = 0. Here the sum is over all index arrays u with 
Os uS vandc(y) = 1. 

In the remaining paragraphs of this section we develop (52) to show 
how a recurrence scheme can be arrived at that permits successive 
determination of the f;,; . 

We note that from (52) 


c(v) 


Tivo = >» fava oeinis0. fisa vsPitis 0] 


s= 


= fi-1y0Qi-100 + fis1y Piti00 


c(v) 


5 d, [fea vsQi-1s0 5 tea ysPi+is ol, (53) 


where we assume v ~ 0. 
Again from (52) 


c(v—p) 


1 
fi- lvs = a2 7 >o! i 2 v—u tGi-2 t r(p) zs fi v—-u tPit rw] 


v 


where the sum on uw is over all arrays with u S v and c(u) = s. A 


similar expression can be written for f;.,,,. Replace the f’s appearing 
in brackets on the right of (53) by these expressions. There results 


fivo = i= 1v0Q9i-100 a ee 


c(v) c(v—p) 


zi 2d, qi- 1s0 eS > lity 


c(v) c(v—p) 


=. 2d Pi+iso ! j 2h it+2 tuv (54) 
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where we have written 


Nityy = fi-2 vu tQi-2t poe fi v-u tPit rip: 
Now in the first sum on the right of (54), 


e(v) 1 c(v—p) 


P= Yaaso dt dh ituy » 


substitute 6 = v — u to obtain 


c(v) c(6) 


Pe Sains eee ee 


where the middle sum is over all arrays 6 with O S 6 S v and c(é) = 
c(v) — s. Since s varies in the range 0 < s S c(v), however, 6 indeed 
ultimately takes on all values < v. Thus 


c(36) 
T= SS Gz ea Goal Qe fete ove 
Oss6<v =F 
Using a similar rearrangement for the last sum in (54) one finds finally 
1 
fivo = fi-1v0Qi-100 + fist v oPis100 + p> = 6)! 


c(é6) 


: 2d [fis 6 tA je fe ea fi é 1B; 6% =P fise ¢ tCrse v-6 t] (55) 


where 
A iut = Qi+icty) 09i rq t 
But = Gi-1 cq oPi rw t + Dist c(p) OF: r(y) t 
Cru = Pi-1 ctw) oPi rw) t + (56) 


If v = 0, the sums in (55) are to be interpreted as zero. 
The quantities A, B and C just introduced are not independent. 
On using (44) and the definitions (49) and (50), we find 


Diss + Qits = Stey,ts) T!S! ori (@ Dre - (57) 
Now sum (56) to find 
A wut i Bint oF Cut = Girw eLDist eae Gea oq ol 
De va) tLDi-1 et 0 Gi-1 ew) ol 
= CE i €or Dee t] duo 
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or finally 
Aiyt + Bint + Cine = Sy0 Sto - (58) 
We note now that the normalization (23) together with (47) yields 
> feet = 5,0 810; (59) 


a normalization requirement of the f’s. 
Now consider (55) when v = 0, 


f ioo = fi-1 0 o9:-1 000 + fi+10 oPi+100- (60) 
Since by (57) Dioo + Gioo = 1, (60) can be rewritten as 
f 1009 :00 > fea 00Pi+100 = at 0091-100 — f iooP i00 (61) 


which is to hold for all 7. Now pioo and gioo are bounded for all 27, and 
by (59) the f’s are summable. Both sides of (61) are therefore summable, 
and summing forz = 1,1 + 1,1-+ 2, --- , we find 


fi-1 0 091-100 — frooPioo = O (62) 
which holds for all /. In addition, from (59), 
a fioo = I. (63) 
These equations are readily solved by setting 
Wo = 1 
di-100 
UU; = W;-1 5 =, dh 2, 
, Pjoo ; : 
Wr=——w,, j= 0,-1,-2,--- 
Qi-100 
Ww; 
f ioo = Su; (64) 


With the fio now determined, we turn our attention to (55) for 
v ~ 0. Replace B; ,_¢ . there by — A; ,-6+ — Ci y-3 + as is allowed by 
(58). Multiply fio by 1 = pioo + Gioo and regroup terms to obtain 


Ui bY; = Usi-1 + Vi-2 (65) 
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where 
Us = fivodioe — fi+iv Pi+100 
1 c(é) 
= L7G (v is, 6)! ¢ pS [fisrAs ie EGS Coes t]. (66) 
Equation (65) is to hold for all 7 = 0, +1, +2, --- . The quantities 


Aiut » Ciyt » Dirs » Qirs are all bounded in 7. The f’s are summable by (59) 
and hence so are the u; and v;. Summing (65) for 7 = 1,1 + 1, 
there results 


Uy = (v, + 1-1). 
Using (66) this becomes 
ee v oPi+i 0007 fivoY.00 = di, (67) 
where 
c(6) 
di: — J 


O<6<v (v — ¢)! > fi-1 6 tAvayot 1 fistAr yo 


Te, feat 6 tCiss yv-s t fi+e 3 tCrse v—6 t]. (68) 


Suppose now that the d;, are known for 7 = 0, +1, +2, --- . If we 
can solve (67) subject to 


> tivo = 0 (69) 


as required by (59), our recurrence is complete, for the d;, depend only 
on the f,y,; with uw < v. Equation (52) for 1 > 0 permits computation 
of the f;,,,7 = 0, £1, +2, --- in terms of the fiy,, u < v. The values 
(64) start the recurrence off. 

Now the solution to (2) subject to (69) is quite a ee 


Introduce the notation £; = fi, 74 = 0, 1, 2, Ne = Jenost = 
0, 1, 2,° 
eo Q:-100 ve DP-(i-1) 00 
: P i00 ‘ qd ~+00 
D; = diay D; 22 ee dg . (70) 
P00 q-:00 


Equation (67) can be written 
fini = Viiarks = Diss ) t= 0, 1, 25 
Nit. = Visi: sie Dis1 , t= 0, 1, 2, ey 
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whence 

fo = No 
t t-1 a 

&£ = IL vi + > D7 [vit Di 
j=1 7=1 7+1 
a i-l t 

an =n IV; + DD; [[ Vi + DF t=1,2,-°-. (71) 
7=1 7=1 7+1 


Adding these equations we find 
Bt Diet Din Hel +V +7) + DL; Dp + DOL; D; 
1 1 1 1 
(72) 
where 
Vea Vit ViVE + VIVIVE + 
= > [] Vv? 
t=1 7=1 
ae A 8 We 
i=1 7=1 
y= V/Vi, = V/V 
L} = Qi - v/V}, 


But the left of (72) is the sum shown in (69) and hence vanishes. We 
have then 


Drag tn gers 
“SE 


with the quantities on the right given explicitly by (70) and (73). With 
fovo known, one can return to (67) in the form 


£o = fovo >< (74) 


1 : 
Fee vo = Lf is0Gi00 a diy]; CS 0, 1, 2, Tae 
Pi+100 
1 : 
tivo = [fier v oPis100 =x d:y|; .= ay —2, ce (75) 
di00 


to compute the remaining f’s recursively, or one can utilize the explicit 
solutions (71). 
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3.4 Recursion for the Pirs 


The formulas just developed for computing the coefficients f;,, 
involve the quantities p;,, and q.;,,. The latter are given in terms of 
the former by (57). We turn our attention now to a recursive method of 
computing the p’s. 

From the generating function (41) we find 


vVae+tOvatn yen’ ¥,@)y¥<(x) 





W(x) r!s! W(x) 
= exp (>> ai én;) 
exp [—4 » as; (a +& + ade; + § + 15)] 
(2) | @ : 





(76) 


where ~ and n are n-vectors. Recall now the definition (49) of piss. 
Integration of (76) then gives 


ety? /as 1 


Sa = exp (2) ai; &;) | —==—= dy 
ris! ne i aa — oo V/ ras: 





exp (>) aijEims) F(a; + & +m). (77) 


Take the partial derivative of this relation with respect to &;, 7 > 1, 
to obtain 


phe ae a 
> me Pirs = DS ast Nk » , | Pirs 
rs k=1 T:Ss! 


ris! 
or 
Die = De esapiehestetocuie concen ; 43> 1. (78) 
A similar formula, obtained by differentiation with respect to 7, 
Pirs = De eat ics ei albaaaih ) k>1 (79) 


also holds. 

Repeated use of (78) and (79) permits one to express p;;, as a linear 
combination of the quantities p;,00 --. 03,00 «» 0 Where 0 S #; S r, and 
0 < §, S s,. Let us now define 


A — 
Pris = D ir100*++05100°**0 
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Deer am un 3! Pris, = exp (tayéim)F@; + & + m) (80) 


with F defined by (77). Without loss of generality, we take a,, = 1 
and note that 


ai .—t?/2 


Boo = J a= dt (81) 


Now differentiate (80) with respect to £, to obtain 
-1 n’ grates +E+7)? 


pa oz 3S = s! Pre = Ont 0 ye rll fh, -++ exp Cae <= (82) 





where we have dropped some unnecessary subscripts. Let 


—hlaitE+n)? 


exp (ai; £7) ayes 2 fr il it - (83) 
Equation (82) then gives 
Perr. = @8P, .-1 + M,, 
and its symmetric version obtained by interchanging the roles of r and s. 


These equations yield 


A _ M, st+1 mM, r+1 
Prs — ann ( == 8) ) r a s 
Pre o, nT Py-1 r—1 Pp M,-1 rs (84) 


To complete the recurrence we must have rules for generating the M’s. 
Differentiating (83) with respect to & gives 


Myst b= aM j:, _ el _ a kM; k-1 gM ;-1 ke (85) 


which permits reduction on j, so that M/;, can be expressed in terms 
of My with 0 < k’ S k. But from its definition, M;, = M),; and from 
(85) we deduce 


Moz = =F aM, k-1 7 (k — 1)M, k—-2 (86) 
Finally, we find 


~az2/2 


Moo = 





—aj?/2 


Von 


| ® 
ae A 


Mo = (87) 
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3.5 The Mean Squared Error 


The mean squared error of the delta modulator running in the steady 
state is defined by 


c= a] aexe - 207 (88) 


where the expectation is to be taken using the steady-state distribution 
of X(¢) and Z(t). Then 


T oO 
é=5f ax] aly -a¥Pey,9 (89) 
0 z —0 
where 


P,{y, )dy = Pr {fy S$ X®) <y + dy, 2 =a,} 


and so 
Pay : dit, *** / dary f dxym;—1(x)B.(y | x) 


4 / (ere / de; | deymix@B.ly |x). (90) 


In this last equation #,(y | x)dy is the conditional probability that 
y = X(t) < y + dy given that X(0) = x. The expressions (89) and (90) 
can also be found easily from the alternate definition 


1 (274+2)T 
Pei / dX) — Z(DP 
j00 2T 271T 


where the expectation is over the actual time varying distribution of 
X(t) and Z(t), not the steady-state distribution. We proceed now to 
express ¢’ in terms of the f;,;. 

The integration on y in (89) can be carried out directly. Using standard 
formulae for Gaussian variates, one finds 


E(X(i) | XO) = x] 


[ dyyp.ly | x) 


> CjX; (9 1) 
1 
where 


¢ = Da Buld), (92) 
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and where we now exhibit explicitly the time dependence of 6;;(t) = 
EX (u)X,;(u + 0). We find further that 


BUX) |X) =x) = [ dyy'py |») 


n 


n 2 
= an — 2) a'Bal)Bn) + ( X ee] : 
t,7= a 
With these results, (89) and (90) can be rearranged to give 


T c) 
e = j dt pS i dt, ++ | daym ;—1(x) 


+ fo dz [ aezmses(&) A +iB+#A7] (98) 
where 
A =a, — 03; Ba()Bj(b) a » CiC;X,X; — 2h oy C2; + h? (94) 
and 


B = —2A > c,x; + 2Ah (95) 
are independent of the index 7 of (93). Now 


ae dz, | dxym;-(x)A 
+ Of are [ deme@a = | dxy@ 
by (23). But 
J cxxy@ = 0, [drew oe) =a, fave) =1, 


so that one finds finally 


[i axp@ A = an — Dak'BalOBa) + Deseyas + 


Qi + h (96) 
on using (92). The mean squared error can thus be written 


e _ a:.¢h+i4+71, (97) 
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where 
h=>d | a dew | © dxym,-s(8) 
+ i da, -*: / ~ dogo) [2a + 7A’) (98) 


and 


T c-) 0 
i — i i dt 2d i AL, ee [- dxym;-1(x) 
+ i Atn + i - degmins) |[—2i8 D> C;%;\. (99) 


The expressions (98) and (99) can be reduced further by using (47). 
One finds directly, for example, that 


I, > Do BT) Uf i-1» 1@i-1 10 + fist y Dis1 rol[20Ah + v A’] 
D BL) DO fiols2h + 7A’). 


Here we have used (49), (50) and (52) with 1 = 0. To reduce J,, we 
first note that from (24) one has 


1 ay _ “1 
W 82m du al 


From (92) we thus obtain 

= ah oy 
De 6; a) 2 B;1(t) an; 

Using this result and (47), we find 

a y 

b=7] ¢ Teo 
0 t vl 
: 2; BrOlfi-rv 1Gi-1 1; 4+ fisa y Misi 1€;]20A 


where e; is the vector having unity for its jth component and zero 
for all other components. Finally defining 


n 1 T 
Qn = Late gf an 
7=1 0 


n 1 T 
P31 De Pate: 7 i dtB;1(t) 
j=1 0 
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we have our desired result 


f= an th + BL) Dd fewli2h + 7A’ 
+ DBD) Do fers Qerr + fier Pier 24M. (100) 


IV. GENERALIZATION TO SYSTEMS WITH REALIZABLE FEEDBACK FILTERS 


4.1 Description of the System 


In this section we consider the modulator of Fig. 2 where the feedback 
filter is a realizable one with a rational transfer function. Again we 
assume that the input X(¢) to the modulator is the first component 
of a vector Markov process 


X(t) = {Xi = XM), Xe, --- , X.(} (101) 


with X(t) normalized as in (5). We shall show how an integral equation 
(131) that generalizes (22) can be written for the steady-state prob- 
ability distribution of this system. 

Let us first describe the system more precisely. The sampler acts at the 
instants kT, k = 0, 1, 2, --- , and its output at time j7 is X — Z 
where we write 

Xi? = XP) 

Z® = ZGT—) = lim ZT — 6, e>0 

‘ «0 


i=1,2,---,n, 7=0,1,2,---. (102) 


This output is acted upon by the quantizer, which at time j7 produces 
an impulse of magnitude U; that is applied instantaneously to the 
filter. We suppose a K-level quantizer with representative values 


@,, G2, *** , a, and decision regions ®,, R2, °°: , Ry. Thus, 
U; = a; if x = fs eR; 
be Li De ere GK, j3=0,1,2,---. (103) 


The @’s are disjoint sets, the union of which exhausts the real line. We 
suppose the filter described by a real impulse function h(r) with 


hr) = 0, 7 <0, (104) 
and that the filter output is 
Z)= DU AE - kT), iT St< Gtr 
k=0 
7=0,1,2,---. (105) 
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Iinally, we define 

Z® = Z0—) =0 (106) 
and suppose the system inactive before time ¢ = 0. Thus the filter 
input and output are zero for allt < 0. The system starts up at ¢ = 0 
when U,, which depends only on X(0), is applied as the first input 
to the feedback filter. 
4.2 The Markov Nature of the Filter 


Suppose now that the transfer function of the filter is rational, 


_ ky * tor P(w) 
Rr) = 5 a aes (107) 
where P(w) and Q(w) are polynomials in w. Let the degree of Q be m 
and denote its roots by 7o;, 7 = 1, 2, --- , m so that 
Qw) = d I] & — ia,) (108) 


where d is independent of w. For simplicity we shall assume that all 
m roots are distinct and that none are also roots of P(w). Expansion 
of P/Q in partial fractions shows that 


~ i) i ToT A; 
Mr) = ge [. ise 


w — 10; 





= Aid; , 7) (109) 
1 . 
where 
f(o, 7) = ‘ . ae (110) 
0, 7 <0. 
lor convenience, we define f(c, 0) = 1. Here we must have 
Re (¢;) > 0 f= 1,2, °* mM (111) 


to insure (104), while the reality of h(r) requires that non-real o’s 
occur in complex conjugate pairs. The corresponding A’s of cach such 
pair in (109) are complex conjugates of each other. 

Now it is well known and easy to establish that when such a filter 
is excited by impulses as in (105), its output at all times can be given 
by the first component of an m-dimensional state vector 


Z(t) = {Zi() = ZO), Zo(t), +++ 5 Sn), (112) 
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that satisfies the equation 
ZT +&) =D©Z” + Uh@), (113) 
0OSt< 7T j = 0,1, 2, - 


Here the m X m matrix D(€) and the m-vector h(é) are independent 
of j. The time-discrete state vector 


Z‘? = ZGT—) = lim ZjT— 60), ¢€>0 (114) 
0 
satisfies the recurrence 
VAseae = DZ” ae Ujh (115) 
where 
D = D(T), h = h(7), (116) 


which follows from (118) by letting —& — T. 
The validity of (113) can be established in a few lines. Let 


h(r) = {hi(r) = (zr), her), ++, hn(r)} (117) 


be the m-vector, the /th component of which is 


hilt) = 0 Ajo; "f(o; 5 7) 
1 
= D tule; ) t) l= if 2, Pe ip HE (118) 
1 
where 
Cj; = Ajo, '. (119) 
lor p = 0,1, 2, --- , one then has the system of equations 
hip Dy ge SS eee com (120) 
j=1 


which can be solved inversely to give 
ei?" = Dy CrllpT), i] S Leah s a Me (121) 
k=1 


Here cj, is an element from the matrix inverse to c = (c;;). The latter 
is non-singular since its determinant as computed from (119) is 


lel = (IT 4.) I - on 


< 
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which is not zero by our assumption of distinct roots for Q(w). 
Now from (120) and (121) it follows that for > —pT 


m™m 
ene Fe? 
7=1 


I 


hi(pT + £) 


= 2 oie 2d cjrh(pT) 


= > di(E)hi(pT) 
or, in vector notation, that 
h(pT + &) = D(@)h(pT), pT +é&>0 (122) 
with 
D®= di), da = Dene “ey. (123) 
k=1 


This is the key to (118). We now define 
Zi) = DU R(t - kT), jTst<Gt+UT 
k=0 


p SO 12 2 (124) 
which has (105) for its first component. Then 


Z(pT +) = ye U,h(pT — kT + &) 


_ 


p- 


= Ul — HT + 44+ Uh} 


k 


u 
° 


= Dif) > Uh{(p — 1 — WT + T) + Ube) 


= D(&)Z(—pT—) + Uh) (125) 
by (122). But this is (113). For this equation to hold for p = 0, we 
must define 

7 = 0: (126) 


4.3 The Integral Equation 


From (115) and (102), it is seen that Z“*” can be defined in terms 
of the random variables X‘” and Z“”. Since X“” is assumed Markovian, 
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it readily follows that the m + n quantities 
xi, a, acess Xa Zs aS aes a (127) 
constitute the components of an (m -++ n)-dimensional time-discrete 


vector Markov process. Denote by m (x, z) the joint density of 
x? and Yas 


n m 
m (x, z) [I dx; [I ae; 
t=1 j=l 
= Pr{x< X)? Sa,+dxy,---,2, SX. S24 dz,, 
21S Zi? Sa tda,-+: 52 SZ San t+ den}. 


Then 


m?(x’, 2’) 


[ax i dzp(x’, 2’ | x, z)m°~"(x, z) 


where p(x’, z’ | x, z) is the transition density for the process (127) 
and is independent of 7. The steady-state distribution m(x, z) for 
the process must then satisfy 


m(x’, z’) = [x / dzp(x’, z’ | x, z)m(x, Zz). (128) 
For the case at hand, the transition density takes a very special 


form. Let 


1, (2, 1 21) eR; 


x, (x, Z) = i ine ace me (129) 


0, (a, +7 21) ¢R; 


describe the quantizer decision regions. Then from (113) and (103) 
we find that 


K 
p(x’, z’ | x, z) = >) x(x, z)pr(x’ | x) 62’ — Dz — ah) (180) 
t=1 


where 6 is the usual Dirac symbol and as in (110) p,(y | x) is the prob- 
ability density of X(¢ + +r) given X(é). Inserting (130) into (128) and 
carrying out the z-integration gives the desired integral equation 


| D | m(x’, Dz) 


K 
=> i dxx (x, Z — a; D"h)p7(x’ | x)m(x,z — a; Dh), (131) 
t=1 


2136 THE BELL SYSTEM TECHNICAL JOURNAL, DECEMBER 1972 


with |D| the determinant of D. This equation is to be augmented 
with the condition 


| m(x, Z) dz = p(x) 


with p(x) as in (111). 

We have not seen how to solve (131). The simplest example’ occurs 
when m = n = 1. We then have RC noise for the signal and an RC 
filter with impulse response h(r) = e *, r > O, say, in the feedback 
path. Taking a, = A, a, = —A and @, the positive axis gives for (131) 


oe) zt+A 
yma’) = [ dem(e,2— Ape’ |2) +f — dum(x,2 + Ape" | 2) 


—oT 
wherey =e”. 
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Conditions of High Gain in Mixers and 
Their Relation to the Jump Phenomenon 


By C. DRAGONE 


(Manuscript. received June 21, 1972) 


A study of the stability of periodically driven nonlinear networks (mirers), 
motivated by recent work on low-noise down-conversion with Schottky 
barrier diodes, is presented. Necessary and sufficient conditions for the 
unconditional stability of a mixer are derived and discussed. It 1s shown 
that potential instability is always associated with the jump phenomenon 
in the sense that a mixer will (under suitable circumstances) exhibit the 
phenomenon af, and only tf, the above stability conditions are violated. 
Application of these conditions to frequency multipliers is also discussed. 


I, INTRODUCTION 


The Schottky barrier diode down-converter is a frequency converter 
that is capable of noise figures below 1 dB in the microwave range with 
operation at room temperature.’ However, this converter is potentially 
unstable, i.e., is capable of arbitrarily high conversion gain. Evaluation 
of the noise performance at high gain requires a knowledge of the 
mechanism of instability, and of the conditions necessary and sufficient 
for instability. Torrey and Whitmer’ derived a simple stability condition 
assuming weak reciprocity and also studied a particular case in detail, 
but their results are not applicable to the down-converter of Ref. 1. 
Here we derive general stability conditions, in closed form, and show 
that instability is intimately related to the jump phenomenon, a type 
of instability peculiar to periodically driven nonlinear networks. These 
stability conditions are applicable to any periodically driven nonlinear 
network (henceforth simply called a mixer) provided it is driven by a 
source (pump) that generates power at a single frequency w, . Because 
they are very general, these conditions can be used for a variety of 
purposes; for instance, suitable design criteria for harmonic generators 
can be determined in order to obviate the jump phenomenon and related 
instabilities in these devices. A brief discussion of this application is 
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given in Section 3.2 following a detailed discussion of the stability of 
the Schottky barrier diode down-converter in Section 3.1. 

Mixer stability is reexamined using the method of Torrey and 
Whitmer’ in their phenomenological theory of frequency conversion. 
This method determines the small-signal terminal behavior of a mixer 
at the input (w, + p), image (w, *F p), and output (p) frequencies 
without any knowledge or assumptions regarding the internal structure 
of the mixer. The method requires that the output frequency p be 
very small, in which case the behavior can be derived directly from the 
terminal behavior of the mixer at de and at the pump frequency », . 
No other assumptions are made. 

The small-signal terminal behavior of a mixer can be represented by 
a nonreciprocal three-terminal-pair network, but no simple stability 
criterion in closed form is known for such a network; Ku’ has resorted 
to graphical and numerical methods. However, because p is assumed 
small, the three-port assumes special properties that permit study of 
its stability analytically. 


II. THEORY 


2.1 Description 


A mixer that is potentially unstable can exhibit the jump phenomenon 
and, vice versa, a mixer exhibiting this phenomenon is potentially 
unstable. To acquaint the reader with our definitions and notation we 
begin in Section 2.2 with some preliminary considerations, including 
derivation of a result of Torrey and Whitmer (Ref. 2)* [eq. (11)]. In 
Section 2.3 conditions necessary and sufficient for avoiding the jump 
phenomenon are derived. In Sections 2.3.3 and 2.3.4, these conditions 
are shown to be necessary and sufficient for unconditional stability of 
the mixer; their sufficiency is shown by proving that if they are fulfilled 
the mixer has passive behavior at w, + p, provided it is terminated 
in a passive impedance at p. In Section 2.4, it is pointed out that because 
of such behavior at w, + p, a certain type of interconnection of stable 
nonlinear networks is unconditionally stable. 

In Section 2.5, we introduce the concept of a stable nonlinear im- 
pedance, and discuss its significance. 


2.2 Preliminary Considerations 


Suppose the mixer is represented (Fig. 1) by a two-terminal-pair 
network M with two ideal filters F, and F, permitting currents to flow 


? We are indebted to H. E. Rowe for suggesting including derivation of eqs. (11). 
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NONLINEAR 


NETWORK 





Fig. 1—Periodically driven nonlinear network M, 


only in narrow bands centered about w, and de. This network is assumed 
to be nonlinear and to contain no sources of energy. 

Let the de and sinusoidal terminal currents be J, and z(t), the latter 
with complex amplitude J, 


a(t) = 2 Re (Ie'**’); (1) 


a periodic steady-state is assumed. Let v(t) and V, be the terminal 
voltages arising at w, and de, and let V denote the complex amplitude 
of v(2), 


v(t) = 2 Re (Ve'*"’). (2) 


Both the de voltage and the impedance presented by the network at o, 
are functions of J, and | J | . We write 


Vz = UE, , | Fi) 








(3) 
V = 00., 12 |)-l. 
It is convenient to choose the time origin so that 7(¢) is a cosine wave, 
a(t) = 2I cos wt, r=|I|, Z2=0. (4) 
If we superimpose small perturbations 6/, and 
éi(t) = 2 Re (dT e’*"'), (5) 
on J, and z(t), and note that 
|\l + 62| =1+ Re ot (6) 
because J is real, then eqs. - lead to the variational relationships 
6V, = Re 61 
aa 0 7 T a|T| (7) 





sv = 5p él, lata 


* This assumption is not used in the following section. 


Re 61 +461, for L=|I| 
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where 6V, and 6V are perturbations on V, and V. 

When the network of Fig. 1 is used as a mixer, a small signal is 
applied at the input frequency (w, + p, or w,; — p), and terminations 
are provided at the output frequency (p) and at the image frequency 
(w: — p, or w, + p). The input signal causes small perturbations at 
frequencies w, + p and p to appear at the terminals of M. We wish to 
derive from eqs. (7) the relations among the various frequency com- 
ponents of these perturbations. Equations (7) hold without change 
even if 6J and 6/, vary with time, provided the variations are very slow. 
Let 

















51 = l(t) = I,e' + I,e%”' 8) 
62, = 61,(t) = Ige'* + Ite”. 
Then the terminal currents of M become 
a(t) + Si(t) = QRe(Tei**! + Tei??? + Tye") 9) 
I, + a1,(t) = I, + 2Re(I,e'”'). 
Substituting eq. (8) in eqs. (7), after replacing J with | J | , 
= —_ ipe —ipt 
6V = 6V(2) Ve” + Vie | (10) 
5V, = 5V.(i) = Vee’! + Vie", 
where 
ry 4 1 03 9 17) _ 98 
Vey Nero aa be lap ple ear. ‘|e 
1 90, av, 1_av, 
Male 23/1 al, 29/1 ts 
1 0a* 0a* 1 03* 
* = ese pecan * ia nt eeeeeey * 
LV? aI lsTr El ar oa ae gl Ty 


(for J = | ZI |). (11) 


Thus we have the Torrey and Whitmer result.” Note that according 
to eqs. (2) and (10) the terminal voltages produced by the currents 
of eqs. (9) can be written 


v(t) dv(t) = 2Re(Ve'**" + Vg) Sete fe ae amd 
V. + 6V,(t) = V, + 2Re(Ve'"). 


V., V, and Vz are the complex amplitudes of the terminal voltages 
at w, + p,w, — p and 7, respectively. 


(12) 
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Equation (11) describes the mixer performance subject to three 
assumptions 


() Quasi-static (p small) 
(it) Small-signal (| 62 | «| J|, 6l, « | J, |) 
(i2t) Zero phase for I [eq. (4)] (not restrictive). 


The matrix elements in (11), and hence the performance of the mixer, 
depend exclusively upon the same coefficients 3, | I | (03/0 | I |), ete. 
that characterize the small-signal terminal behavior at w, and de 
[see eqs. (7)]. 


Stability: The network M is unconditionally stable, for a given steady- 
state condition, if the powers 


ReValz), Re(Vsls),  Re(V 7) (13) 


absorbed at w. + p, p and w, — p cannot simultaneously become 
negative. On the other hand, if 


Re(V.1*) <0, Re(V,It) <0, Re(V,I*) <0 (14) 


simultaneously, then M is potentially unstable. In this case, spurious 
oscillations at w, + p and p can be produced (without sources at these 
frequencies) by terminating M with appropriately chosen impedances 
at w, + p and p. A potentially unstable mixer can have (in principle) 
unlimited conversion gain. 


2.2.1 The Jump Phenomenon* and Stability 


Now suppose a one-terminal-pair network N is constructed by con- 
necting M to a linear circuit consisting of a fixed resistance F, in series 
with a constant voltage K,, as shown in Fig. 2a. For given values 
of R, and FE, , the impedance 


Y 


Z=R+jX =F 


(15) 
presented at w, is now a function only of the magnitude of J. Let us 
connect in series to this network a linear passive impedance Z, as 
shown in Fig. 2b, and let # denote the complex amplitude of the voltage 
e(t) arising at the terminals. The behavior at w, is described by the 
equation H = [(Z + Z,). Thus, if R, and X, denote the real and 


+ This phenomenon is discussed in various texts on nonlinear differential equations 
(e.g. Ref. 4) for systems governed by Duffing’s equation. 
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i(t) = 2Re (Ie! 1") 
———_» 





e(t) = 2Re (Ee! @1") se 


(b) 


Fig. 2—Networks consisting of (a) M connected to a de supply, and (b) M con- 
nected to a dc supply and a passive impedance Z;. 


imaginary parts of Z,, we can write the following relation for the 
magnitude of # 


|E| =|I| V@, +R) t+ (X%, + XY)’, (16) 


where it is important to keep in mind that R and X are functions of | I |. 
The form of these functions depends, as seen from Fig. 2a, upon the 
values of R, and E, and the behavior of M. 

The jump phenomenon occurs when | £ | is not a strictly monotonic 
function of || (.e., when | | has a negative slope for some | I |). 
For instance, suppose that | #| has the behavior of Fig. 3, and let an 
ideal voltage source with zero internal impedance and variable | Z| be 
connected to N as indicated in Fig. 3. Then, if | #| is gradually in- 
creased, starting from | #| = 0, | Z| will increase smoothly until it 


Ie] 





Fig. 3—Jump phenomenon. 
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reaches a critical value for which 


| E | 
dlT] 0. 
At this point | Z| will suddenly jump to another value*, as indicated 
in Fig. 3. If | Z| is then decreased, | J {| will decrease smoothly until 
another jump occurs, as shown in Jig. 3, ford | #|/d|I| = 

The following stability criterion, the validity of which will be proven 
in the following two sections, plays a central role in this paper. 
Stability Criterion: Suppose one wants to determine whether or not M 
is unconditionally stable for a given steady-state condition. Assume in 
Fig. 2b that R, and Z, are arbitrary, but that /, and | J | have been 
chosen to produce the given steady-state condition in M. It will be 
shown that M is unconditionally stable if, and only if, the following 
property is obeyed: 


d\E| 
d | I | 


== 


>0O forall R, R,20, and X,. (17) 


This result implies that, if for a given steady state M is uncondi- 
tionally stable, then discontinuous jumps from the steady state in 
question cannot occur, no matter what the values of R,, R, and X, 
may be. If, on the other hand, M is potentially unstable, then the jump 
phenomenon can be produced by certain choices of R,, R,, and X,. 


2.3 Stability Criteria 


In the first part of this section we will show that requirement (17) 
demands that the behavior of R + jX as a function of | J | satisfy 


the inequality 
d(| I | R) ( dX iY 
caer a a > parapa (18) 





Since the derivatives of R and X in this inequality depend not only 
upon the properties of M but also upon the value of R, , this inequality 
must be fulfilled for all R, 2 0. In Section 2.3.2, we determine the 
relationship between #, and the derivatives of R, X, and show that if 
inequality (18) is fulfilled in the two particular cases 


Rae (19) 


t Assuming, of course, that for the steady-state condition corresponding to this 
new value of || the circuit is stable, and that. a transient leading to this new condition 
(from the unstable condition) exists. 
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and 
B= 0; (20) 


then it is in general also fulfilled for all positive R, (‘‘in general” means: 
except when 00,/dJ, S 0, as we shall see). Furthermore, it will be 
shown (see Section 2.3.2) that in these two cases inequality (18) becomes, 
respectively, 








a(| I | &) : (=) 
4K TT] a aucun (21) 
and 
ap 20» | 2. AJIT |R) dv. al FL) | 
al, Lal, a|I|  a|I| ol, 
ax dv, ax av, \’ 
> (11 | st Be | Earral oe 


where the functions R = ® (J,, |Z |) and x = X (J,, | Z|) are the 
real and imaginary parts of = 3 (7, , | I |). At the end of Section 2.3.2 
we will find that for requirement (17) to be fulfilled it is necessary and 
sufficient that the above two inequalities be fulfilled, and that 
OU, 

R> 0, on > Os (23) 
In Sections 2.3.3 and 2.3.4, these inequalities are shown to be necessary 
and sufficient for unconditional stability of the mixer. 


2.3.1 Significance of Inequality (18) 


For a given steady-state of M, and given values of R, and £, in 
Fig. 2, we wish to show that the requirement 


d|£ | 
d|I| 


is fulfilled if, and only if, R > 0 and inequality (18) is fulfilled. 
First, note that if R < 0 then requirement (24) is certainly violated 
because one can verify, using eq. (16), that d | # |/d|I| = 0 for 
dx 
R, = —R, Pee anes ENG ee 
Thus for fulfillment of requirement (24) it is necessary that R > 0. 
Next, we show that if R > 0, then for fulfillment of (24) it is necessary 





>0O forall R, 2 0, xX, (24) 


STABILITY OF MIXERS 2147 


and sufficient that inequality (18) be satisfied. We begin by noting 
that requirement (24) is equivalent to 


[HE |d|E | 
[2 jd |7| 








>0O forall R, 20, X,. (25) 


(Note that | #| # 0 because R > 0.) Let us therefore examine the 
dependence of the ae 
[E 1_ az |’) 
| [I | @| Z| 


: 








bo 


upon X, and &,. If one calculates this quantity, using eq. (16), it is 
found that its minimum value as a function of X, occurs for 


ee! a7 20] 
while its minimum value as a function of R,? oceurs either for 
os wl ate) 


or for R, = 0, according to whether the value given by eq. (28) for R, 
is positive or negative. In the former case one finds, using eqs. (16), 
(26), (27) and (28), 


Hide = ffir) oie GP] 
(ST) AVON ppg el arr 

Thus, requirement (24) is surely violated if the value given by eq. (28) 
for R, is positive. To satisfy (24) it is therefore necessary that (28) be 
negative. That is, it is necessary that 


d(| I | R) 


> 0, (29) 


in which case one has 
(2la le) -eMA1B _1) 7p (42) 
[L | d| I [/min d| I | 4 d|I| 


This expression is positive if (and only if) inequality (18) is satisfied. 
Thus, requirement (24) is fulfilled provided inequalities (29), (18) and 
R > 0 are satisfied; since the first of these is implied by the latter two, 


t Note that we assume R, = 0. 
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it is necessary and sufficient that only inequality (18) be satisfied, 
and R > 0. 

2.3.2 Derivation of Inequalities (21), (22) and (23) 

We can write 


dX ox di, a. ox 
d|I| ol,d|I|° a|I| 








(c=axi,, |Z |). (30) 


An analogous expression can be written for d (| I | R)/d |Z] [substi- 
tute X — | J | R throughout eq. (30)]. Thus, we can write for inequality 
(18) 


aj Z| @) , aj 7a) ae. | 
sn| 04 er maa ire 
»{ ax , ox di, | 
re st +e ath Oe. 180) 


From Fig. 2a, U,(7., | Z|) + RJ. = H,. Differentiating this relation 
we obtain 

















dv, dv, a 
aT. Ue + arpa i | + Re dl. = 0. 
Thus, 
dl, _ —-dv,_—*«| OU, re 
fi = [2 + 2, | 2) 


Using this relation we obtain from inequality (18) in the two cases 
R, = © and R, = 0, inequalities (21) and (22) respectively, as stated 
at the beginning of this section. 

The conditions necessary and sufficient for fulfillment of inequality 
(31), for all R, = 0, are obtained by noting that this requirement 
demands that 


Ov, 
al, 


because the magnitude of d7,/d | I | becomes infinite* (and consequently 
inequality (31) is violated) for R, & —(dV,/oel,). Therefore let dv,/dl, 
> 0. Then dJ,/d |I| is (for R, = 0) a continuous function of K, , 
and it varies from the value 





> 0, (33) 


* We assume (0Z/0/,)-(0V./da|I|) # 0. One can show that the necessary and 
sufficient conditions for the stability of M/ are also given by inequalities (21) through 
(23) in the special case (0Z/d7,)-(0V,/a|I|) = 0. 
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Ov, OU, 
mee ial 








to 0, as R, varies from 0 to «. If y denotes the left-hand side of in- 
equality (31), and x denotes d/,/d | I | , we have from inequality (31) 
ax |” 
al, 


ay _ 
da” 





ma); 








It follows that y cannot have interior minima in the interval 6 S z S 0; 
therefore the lowest value attained by the left-hand side of inequality 
(81), as dI,/d | I| varies from 8 to 0, must occur at one of those end 
points. Since those two points correspond to two cases R, = 0 and 
R, = ©, one concludes that if inequality (31) is fulfilled in these two 
cases then it is also fulfilled for all R, > 0. The conditions necessary 
and sufficient that inequality (31) be fulfilled for all R, > O are, there- 
fore, inequalities (21), (22) and (88). 


2.3.3 Necessity of Inequalities (21) through (23) for the Unconditional 
stability of M 


We have derived inequalities (21) through (23) from the behavior 
of the network of Fig. 2a at w, , by requiring Z to satisfy inequality (18) 
for all R, 2 0. Alternatively, these inequalities could have been derived 
from the de behavior of the network of Fig. 4, by requiring that the 
derivative of V, (with respect to J,) be positive for all passive Z; .* 
In fact it is shown in the Appendix that this requirement and require- 
ment (17) are equivalent; this implies that if inequalities (21) through 
(23) are violated, then by properly choosing Z,, one can make the 
network of Fig. 4 exhibit a negative differential resistance at de as 
illustrated in Fig. 5. Since such a network is potentially unstable, we 
conclude that imequialities (21) through (23) are necessary conditions 





Fig. 4—Network consisting of M driven by a pump with internal impedance 2. 


* This requirement is discussed in Ref. 2. We have chosen to oe our stability 
conditions from requirement (17), rather than the requirement dV,/dI, > 0, because 
one of the purposes of our derivation is to point out the relation between inequalities 
(21) and (22) and inequality (18). This relation is essential for the proof in the follow- 
ing section. The significance and practical importance of inequality (18) is pointed 
out in Section 2.5. 
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Fig. 5—Example of a de characteristic with a negative slope. 


for the unconditional stability of M. In the following section it is shown 
that they are also sufficient conditions. 


2.3.4 Sufficiency of Inequalities (21) through (28) for the Unconditional 
stability of M 

In Fig. 2a, assume that the internal impedance of the linear circuit 
connected to M is, instead of a frequency-independent resistance R, , 
a passive impedance Z,(w) with the arbitrary value Z, at w = p. Let a 
small perturbation 67(¢) containing the frequencies w; + p be super- 
imposed on the terminal current 2(¢) of this network, as shown in Fig. 6. 
According to the definition of Section 2.2, M is unconditionally stable 
if it is impossible that 


Re [I,V%] < 0, Re [1,V*] < 0, (34) 
simultaneously. Recall that J, , I, ,V.,. and V, are the Fourier coeffi- 


cients of 67(f) and dv(¢) of w, + p:p [see eqs. (9) and (12)]. In this section 
we show that inequalities (21) through (23) guarantee 


Ret Vea 7 Ve) 0. (35) 
An obvious consequence of this result is that Re [1,V#] and Re [I,V*] 


cannot simultaneously become negative, that is, M is unconditionally 
stable if inequalities (21) through (28) are fulfilled. 





i(t) + 6i(t) ——> 


v(t) san 


| eee eee npee m 


Fig. 6—Network N with perturbations at. #: + p and p. 
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Theorem:* If inequalities (21) through (23) are fulfilled and Re (IgV) < 0 
then necessarily Re (I,.V* + I,V*) > 0. 


Proof:’ The relations imposed by M among J, , Ig, I, , Va, Vg and V, 
are given by eq. (11). By using the constraint Vz; = —Z,I, , which is 
imposed by the linear circuit at » = p, one may easily eliminate from 
eq. (11) the variables Vz and J, , so as to obtain the following relations 
among J, ,/,,V,.andV,, 




















a at (36) 
Vy ry 
where 
hii siiBe, 
03 
fh a mete emcee 
Zeai'= a ats (37) 
1 03 
sini air eel, 
+4 — 
80, oF 


Condition (85) demands that [Z ote + [Z.,,]* (the superscript ( )t 
denotes the Hermitian conjugate) be positive definite. If we introduce 
the new quantities 


oe -y — ULI8) , 118) 
Zu ise R, Si jXu — FS) | I 7 ae al, é (39) 


then from eq. (87) we obtain 


t R,+a—-xX, &-—A+ 7K, 

[Zev] + [Ze] = « 41) 

R= RHR REG +X, 

* Note that the condition Re7J.V,* + IyV_*) > 0 if Re(IgV,*) < 0 is more 

ee than the condition imposed by the requirement of stability (see Section 

2.2.1 

’ Throughout the proof it is assumed implicitly Re(7gV,*) < 0, since the impedance 

4g = Z,(p) in Fig. 6 is assumed passive. 
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One can verify that this matrix is positive definite if, and only if, 
Rk > 0 and 


4R,A — |Z, |? > 0. (42) 


Thus, in order to prove the above, theorem one must show that in- 
equality (42) is satisfied if inequalities (21) through (23) are satisfied. 

Comparison of (88) and (82) shows that for Z, = R, the quantity 
£ appearing in eqs. (89) and (40) equals dJ,/d |I| . Furthermore, 
if & = dI,/d |I| , then inequality (42) reduces to inequality (31), as 
one can verify using eqs. (89) and (40). Thus, for Z; = R, , inequalities 
(42) and (81) are equivalent. It follows that if inequalities (21) through 
(23) are fulfilled then inequality (36) is certainly satisfied for X, = 0. 
We will now show that if inequalities (21) through (23) are fulfilled, 
inequality (42) is satisfied even if X, ¥ 0. 

It is convenient to introduce the quantity 


2 0 o ; 2 
Q = 4R.R — |Z, (| (2: +2) eae 


a product of two factors. This second factor is always positive and the 
first is the expression appearing in inequality (42); it follows that 
inequality (42) is equivalent to the condition Q > 0. Now let us con- 
sider the behavior of Q as a function of X,. Using eqs. (88) through (40), 
it can be verified that 


22-bit (113%) 


From this relation we see that if inequality (21) is fulfilled, then the 
minimum value of Q (as a function of Xg) occurs for X¥, = 0. That is, 
Q is positive for all Xg provided it is positive for X, = 0. Since we 
already know that inequalities (21) through (23) insure Q > O for 
X, = 0, we conclude that they also insure Q > 0 for all X, . Thus, if 
inequalities (21) through (23) are fulfilled, then ® > 0, Q > O and 
condition (85) is fulfilled. 


2.4 Lossless Interconnection of n Nonlinear Networks 


In this section certain properties of a lossless interconnection of 
stable nonlinear networks are discussed; these illustrate the significance 
of the theorem of the preceding section. 

Consider n networks N, , N.--- , N, of the same type as the network 
of rig. 6. Let them be connected as shown in Fig. 7, through a (n + 1)- 
terminal-pair linear time-invariant lossless network L, resulting in a 
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Fig. 7—Lossless interconnection of n networks Ny, --- , Nu. 


one-terminal-pair network N. Let N be driven by the sinusoidal current 
a(t) of eq. (1) and assume that 7(¢) produces in N a periodic steady-state 
with frequency w, . Assume that M,, M2, --- , M, are unconditionally 
stable for this steady-state. Then the small-signal terminal behavior 
of N satisfies condition (35) (i.e., N is passive at w, + p), no matter what 
the values of the passive impedances Zo:(w), Zoo(w), «+: , Zon(w) for 
w = p may be. This is a direct consequence of the theorem of the 
preceding: section, which shows that if we superimpose on z(t) the 
perturbation 67(¢) of eqs. (9), then the total power absorbed at w; + p 
by N, (r = 1, 2, --- , n) is necessarily positive. Thus, the power absorbed 
by N (the sum of the powers absorbed by N,, ---, N,, because L is 
lossless) is positive. 

Note that this result implies that when M,, --- , M, are uncondi- 
tionally stable, then the (n + 1)-terminal-pair network C consisting 
of M,,--- , M, interconnected through L (Fig. 7) is also unconditionally 
stable. Thus, a lossless interconnection (of the type represented by 
the network C) of unconditionally stable networks M,, ---, M, is 
unconditionally stable. 

Another consequence of the above result is that the impedance Z 
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presented by N at w, must satisfy inequality (18) no matter what the 
values of the positive de resistances Ro, = Zoi(0), Roo = Zo2(0), --- , 
Ron = Zo,(0) may be. In fact, in the following section we will see that if 
inequality (18) were violated for some choice of Ro:, Roz, -°: , Ron 
then the small-signal terminal behavior of N at w; + p would be poten- 
tially unstable, which cannot be, because we have already seen that C 
is unconditionally stable. 


2.5 Concept of a Stable Nonlinear Impedance 


Definition: The nonlinear impedance Z presented at w, by a one- 
terminal-pair nonlinear network N which can exchange power only in 
the vicinity of w, and does not contain time-varying sources of energy 
(such as the network N shown in Fig. 2a or the network N of Fig. 7) 
is said to be stable if (and only if) it satisfies inequality (18) and R > 0. 

An important property of a stable, nonlinear impedance has already 
been pointed out in Section 2.3, where it was shown that such an 
impedance cannot give rise to the jump phenomenon. We now want to 
point out another property of this impedance in connection with the 
small-signal terminal-behavior of N at w, + p. 

If the frequency p is so small that the value of Z,(w) (Fig. 6) for 
w = pcan be assumed equal to its value for w = 0, Zs = R, , then the 
small-signal terminal behavior of the network N of Fig. 6 at wo, + p 
is uniquely specified by Z and the derivative of Z with respect to | I | . 
In fact, if in eq. (88) we set Z, = R,, then ~ can be identified as the 
derivative of I, with respect to | I | [see eq. (32)], and therefore according 
to eq. (87) the small-signal terminal behavior at w, +: p can be expressed 
in the form 





v.) Z+4ltlao7 aaa i, 

= ; (43) 
dZ* dZ* 

ve) | ai fe wea] on 


provided J = |I|. This equation? is also applicable to the network 
of Fig. 7, in which case p must be sufficiently small such that Z,,(p) = 
R,, (r = 1, --- , n). Now, one can easily verify using this equation that 
the conditions necessary and sufficient for passivity are identical to the 


t Note that this equation can be obtained by the same method used in Section 2.2 
to derive eq. (11). In fact, the matrix of eq. (43) can be formed directly from the 
matrix of eq. (11) by deleting from this matrix the second row and second column and 
then replacing 0Z/d|I| with dZ/d|I\ throughout the resulting 2 X 2 matrix. 


STABILITY OF MIXERS 2155 


conditions necessary and sufficient for unconditional stability, and are 
given by inequality (18) and R > 0. Thus, we can say that a stable 
nonlinear impedance insures passive behavior against small perturba- 
tions at frequencies very close to w, (this property is in accord with 
the theorem of the preceding section). 

An interesting consequence of these results is now discussed in 
connection with the circuit of Fig. 7, which has been redrawn schemati- 
cally in Fig. 8. Consider the small-signal terminal behavior of this 
network about some given steady-state condition and assume that the 
impedances Z,, ---, Z, presented at w, by the nonlinear networks 
N,, --:, N, are stable. Let p be sufficiently small so that eq. (48) is 
applicable to each nonlinear network JN, (ie., let Z,,(p) = #.,). Then 
each nonlinear network is passive at w,; + p and therefore N is also 
passive at w, + p; this implies that the impedance resulting from a 
lossless interconnection (of the type shown in Fig. 8) of stable nonlinear 
impedances is a stable impedance.' In particular, if two stable non- 
linear impedances are connected in series, or in parallel, the resulting 
nonlinear impedance is stable. 

This last result has an important application in connection with 
harmonic generators. Often such nonlinear networks are driven by 
pumps that are not linear and that can be represented by an equivalent 
circuit consisting of a nonlinear impedance Z, in series with an ideal 
voltage e(¢). The result in question shows that, even in the case of a 
harmonic generator driven by such a pump, the jump phenomenon can 
be prevented by designing the pump and harmonic generator so that 
both of their impedances (Z, and Z) satisfy inequality (18). A particular 





Fig. 8—Network N. 


+ Such an interconnection will therefore satisfy requirement (24). 
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case discussed in the following section will show that a harmonic 
generator can actually be designed to satisfy inequality (18) for all 
magnitudes of its input current I. 


Ill. APPLICATIONS 


Two applications are now discussed, but first we summarize some 
results of a previous study’, concerning stability and noise in a Schottky 
barrier down-converter. That study motivated the present theory; 
conversely the present theory was needed in that study. In Section 3.2 
results of a study of the jump phenomenon (following certain experi- 
mental work on solid-state power sources’) are given; other related 
effects (e.g. starting problems) are also discussed. 


3.1 Schottky Barrier Down-Converter’ 


Figure 9 shows schematically a network consisting of a Schottky 
barrier diode connected to two filters F, and Fy , which permit currents 
to flow only in narrowbands centered about w = w, and w = 0 respec- 
tively, and which have zero impedance at those frequencies. The diode 
is represented (to good approximation) by the equivalent circuit of 
Fig. 9b, consisting of a small resistance R, and two nonlinear elements, 
the barrier capacitance C,(v,) and the barrier resistance R,(v,). C,(v,) 
and the current 7, through R,(v,) are assumed to obey the familiar 
relationships 


Cinin Vb — Vz 
Vd — Vv, 


in = i exp (2) = | (45) 





CL@,) = (44) 


v(t} 





(a) 


Fig. 9—Down-converter consisting of a Schottky barrier diode and two filters 
F, and F. 


STABILITY OF MIXERS 2157 


where V;, is the breakdown voltage of the diode and @ the contact 
potential. 

This network is of the same type as the network M considered in the 
preceding sections. Therefore, its stability can be studied using in- 
equalities (21) and (22) (for this network, inequalities (23) are always 
fulfilled). Its impedance Z at w, can be written 


Z=Rh, +k, + 7X2, (46) 


where Z, = R, + 7X, is the impedance presented by the barrier of the 
diode. Because F,; and FI, allow current to flow only in the vicinity of 
® = w, and w = O, the current through the diode cannot have compo- 
nents at the harmonics 2, , 3w, , 4; , etc. (and at their side frequencies 
2w, + p, 380, + p, 4w, + p, ete.). This condition is an important 
requirement for low noise down-conversion. Another important require- 
ment is that the diode should be fully pumped. That is, the current I 
should have the largest magnitude allowed (for a given J,) by the 
breakdown voltage V, of the diode; we assume that this is so. Then, 
if V, is sufficiently large, this circuit has the following properties.’ 

First, for unconditional stability, it is sufficient (and of course neces- 
sary) that inequality (21) be fulfilled [for this circuit, inequality (22) is 
always fulfilled if inequality (21) is fulfilled]. Second, 


ac 


ST] = oaCen’ 


(47) 
where, according to eq. (44), Cin is the value of C,(v,) for v, = Ve. 
Third, the power absorbed by the barrier resistance is very small’, 
so that inequality (21) is violated provided RF, is sufficiently small. 
To find out how small R, should be for the circuit to be potentially 
unstable, one can neglect A, with respect to R, in eq. (46). Then, in- 
equality (21) requires 


1 
2 


ox 


oe sarah 


| I (48) 
From this inequality and eq. (47), we find that high gain is possible 
provided 
aah 2 4, (49) 
We 
where w, is the cutoff frequency of the diode, w, = (R, Cnin) . According 
to this inequality the highest pump frequency for which a given diode 


? For this to be true, #: must be large, such that the diode behaves essentially 
like a variable capacitance for v,» < 0 (see lef. 1). 
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can be made to exhibit arbitrarily high gain is approximately w,/4. 

This simple relation is valid provided V, is very large. If Vz is finite, 
high gain can be obtained only if nw,/w, < 1, where 7 is a parameter 
which is typically less than 6.25 (but always greater than 4; see Ref. 1). 


3.2 Abrupt-Junction Varactor Doubler® 


Experimental varactor multipliers’ exhibit instabilities of the type 
considered in this paper. In practice it is often found that empirical 
techniques are necessary to make a varactor multiplier self-starting. 
Furthermore, the range of frequencies, temperatures and powers over 
which a varactor multiplier shows stable and efficient operation are 
often seriously limited by the jump phenomenon. Little is known about 
the restrictions that should be imposed on multiplier design to prevent 
such undesirable effects. 

In this section, the simplest varactor multiplier, the doubler with 
abrupt-junction varactor, is considered. Our main result is a stability 
diagram which gives, for any given varactor characteristics, the input 
frequencies w, and: load resistances R,, for which discontinuous jumps 
and starting problems may occur. It is shown that these nonlinear 
effects can always be prevented in an abrupt-junction doubler by 
properly choosing the output load. In particular, these effects will not. 
occur if the output load is optimized for maximum efficiency. This 
result, obtained with the help of data from Penfield and Rafuse,” shows 
the importance of optimizing the efficiency of a multiplier in order to 
reduce starting problems, discontinuous jumps and spurious oscillations. 


3.2.1 Assumptions® 


The model of Fig. 9b is assumed for the diode. Since we are interested 
in converting power from pump frequency w, to its second harmonic 
2w, , the diode is assumed to be terminated by a load impedance Z, = 
Ry, + 7X, at 2w, and to be open-circuited at 3, , 4w, , 5w, , ete. Further- 
more, we assume that it is biased at w = 0 by a fixed voltage V,. 

The doubler can then be represented as in Fig. 10. The three networks 
F,, I’, and F, are ideal filters. The impedance of F, (r = 0, 1, 2) is 
assumed to be zero for w & rw, and infinite for | w — rw, | > w,/2 (w > 0). 
We are interested exclusively in the behavior of this circuit in the 
particular case V, = constant.! 


t The behavior for the two cases V. = constant and J, = constant is discussed 
in Ref. 1, for the limiting case Ry = © (some of the results of Ref. 1 have been 
pointed out in Section 2.6). It is shown in Ref. 1 that the condition V, = constant 
yields greater stability than for J, = constant. 


STABILITY OF MIXERS 2159 


i({t) 
_———t © 


2 —_— 2. 





Fig. 10—Doubler. 


Let J be the complex amplitude of the diode current at w,, and 
v,(¢) the voltage across the barrier. The doubler will normally be operated 
about a particular steady state, corresponding to some value I, of J. 
Let v,,(t) denote the voltage across the barrier for this particular steady- 
state, and assume that this steady-state is characterized by the condition 

x, = Sua, (50) 

4w, 

where Sy and S,, denote, respectively, the maximum and minimum 
value of the elastance of the diode for v, = v,,(t) (we make this assump- 
tion because maximum efficiency for a doubler occurs approximately 
when this condition is fulfilled’). Throughout this section we also assume 
that the operation of the diode is restricted to the range of voltages for 
which the barrier capacitance is predominant over the barrier resistance. 
We therefore neglect the barrier resistance (see Fig. 9b) and represent 
the diode simply by a resistance R, in series with a variable elastance 
S(v,) = C™'(v,). 

According to Section 2.5, the stability of the doubler at frequencies 
close to w, depends on the sign of the quantity 


nip = ae PD _ (17) ) (51) 


where R and X are the real and imaginary part of impedance Z pre- 
sented by the diode at w, . If, for some value of | J! , (| Z|) < 0, then 
restrictions must be imposed on the diode terminations at frequencies 
close to w, in order to prevent spurious oscillations at these frequencies, 
for that value of | Z|. Furthermore, restrictions must be imposed on 
the internal impedance Z, of the pump at w, , in order to prevent the 
jump phenomenon for 0 S$ |J| S | J, | . If, on the other hand 


n(L|)o> 0. for Os | 1] <1 tels (52) 
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then the jump phenomenon and the above spurious oscillations will not 
be possible for all values of | Z| in the intervalO $ |7| S |J,|. 


3.2.2 Results 


The functional relationship between Z and |J| has been obtained 
in a straightforward manner using the procedure described in Ref. 6 
(pp. 299-335). For given diode characteristics, the form of this relation- 
ship depends upon the value of R, . The effect of this parameter on the 
stability of Z for S,,/Sm = 0 is shown in Fig. 11. The stable region of 
this diagram gives the values of R, and R, for which condition (52) is 
fulfilled. The boundaries of this region are characterized by the property 
that the minimum value of »(|J|) over the interval 0 S |J| S |J,| is zero. 
It is interesting to note that there are values of R, for which condition 
(52) is fulfilled even if R, = 0. The dashed curve of Fig. 11 is the curve 
given by Penfield and Rafuse® for the load resistances required for 
maximum efficiency at |I| = |J,| . Note that this curve is inside 
the stable region, as pointed out earlier in this section. 

The unstable regions consist of the points for which n(| J |) < 0 for 
some values of |I|,0 S$ |Z| S | J. |. These regions can be divided 
into subregions having different properties, as indicated in Fig. 12. 
In subregions @) and @) n(| J. |) > 0. In @ and ® n(| I |) < 0 even if 


— 
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Fig. 11—Stability diagram of the abrupt-junction varactor doubler for Sm/Sm 2 0. 
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Fig. 12—Details of the diagram of Fig. 11. 
| Z| = |Z, |. For any point in one of these four subregions, the value of 


I always determines uniquely the voltage v,(t) across the barrier of 
the diode. For any point in ©), on the other hand, there are values of | J | 
for which »,(¢) is not uniquely determined by J, as illustrated by the 
example in Fig. 13, where V,, denotes the minimum value of v,,(d). 
(Thus, Sq = C'(V,,)). To prevent such undesirable behavior it is 
necessary (and sufficient) that 


Ry, +R, > —&, (53) 
1 
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Fig. Pe of a |V| — |Z| characteristic in subregion ® (Ry = 1.14225 
107? Salas R, = 0). 


where é = 0.0166. Note that in the example of Fig. 18 the desired 
condition v(t) = v,-(f) cannot be obtained by simply increasing | I | 
very slowly from zero to | I, | . 

For points in regions @), @), @) and @ in Fig. 12, constraints must be 
placed upon the pump impedances Z, in order to avoid discontinuous 
jumps and starting problems. For instance, consider the case R, = 0 
and Ry, = 0.3565 Sy/w, . One can see from Figs. 11 and 12 that such a 
multiplier is potentially unstable, since it corresponds to a point located 
in subregion @). The variation of the input voltage with current is shown 
in Fig. 14. It can be shown that if one connects in series to the input of 
this multiplier, an inductance (having reactance jX,) chosen to tune 
jX for| I | = |Z. |, the voltage FE across Z + 7X, will exhibit the behav- 
ior given in Fig. 15 by the curve corresponding to z, = 0.5. Figure 15 
also shows two examples of the behavior arising for X¥, < Sm/2a 
(it can be shown that Sy/2w, is the value of X, needed to tune X for 
|Z | = |Z, |). All the characteristics of Fig. 15 exhibit a negative dif- 
ferential resistance over part of the range 0 S |J| S | J, | . Further- 
more, in each case there is a range of voltages for which more than one 
value of | Z| is possible for a given value of | / | . In all cases the range 
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Fig. 14—Example of a |V| — |J/| characteristic in subregion @ (Ry = 0.3565 
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in question contains the voltage for which |J| = | J. | . The dotted 
curves of Fig. 15 show the effect of a small series resistance R, ; they 
have been calculated for w,/w, 
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Fig. 15—|E| — |/| characteristic corresponding to the example of Fig. 14 (R, = 0). 
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3.2.3 Differences between this Analysis and that of Refs. 8-10 


For any point in the unstable region of Fig. 11 restrictions must be 
placed on the diode terminations at w, + p (small p) in order to prevent 
the appearance (for some value of | J | in the interval O S | I| S | J, |) 
of spurious oscillations at w, + p. The mechanism responsible for such 
spurious oscillations differs from that discussed in Refs. 8-10. The 
spurious oscillations considered in Refs. 8-10 can, in general, be pre- 
vented by imposing suitable restrictions on the diode terminations at 
2w, + p. In particular, they cannot occur if p is low enough so that the 
terminations at 2w, + p are essentially equal to Z, . On the other hand, 
we have just shown that spurious oscillations are possible even if this 
condition at 2w, + p is fulfilled. This discrepancy between the results 
of the two analyses arises because the analysis of Refs. 8-10 is not 
applicable to the circuit of Fig. 10, since in this circuit the diode is 
short-circuited for w = p, whereas in Refs. 8-10 the diode was assumed 
to be open-circuited at w = p.? Furthermore, in Refs. 8-10 the output 
load was assumed to be tuned, whereas in our analysis consideration has 
not been restricted to the particular pump level for which this condition 
is verified. 


3.3 Concluding Remarks 


The jump phenomenon is a form of instability. Thus it should not be 
surprising that a mixer capable of producing this nonlinear effect is 
potentially unstable, and vice versa. The derivation of inequalities 
(21) through (23), which are the stability conditions necessary and 
sufficient for the stability of a mixer, has been organized to demonstrate 
the important relationship between the jump phenomenon and mixer 
stability. A knowledge of this relation is requisite for an understanding 
of the mechanism of instability in a mixer; it is useful in experiments 
whenever one wants to determine whether or not a given mixer 1s 
potentially unstable. For that purpose the simplest procedure is to 
connect the mixer to a pump and a de bias supply (as shown in Fig. 3) 
and then determine (in the two cases R, = 0 and Rk, = &) whether, 
by varying | #| and Z,, the circuit can be made to exhibit the jump 
phenomenon. This procedure is straightforward and has been used 
extensively in experimental work on down-converters.' 


* More precisely, in Refs. 8-10 the impedance Z terminating C(v,) at o = p is 
assumed to be sufficiently large so as to insure negligible charge fluctuations at 
w = pin C(v). However, this condition cannot by realized in the limiting case p — 0 
because it can be shown that for p — 0 this condition requires that 

lim,» pZp Pe, 
This requirement. is unrealizable. 
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In Sections 2.6 and 2.7, two applications of practical interest (down- 
converter and doubler) have been considered; they are quite different 
in many respects. A fully-pumped down-converter is, in general, poten- 
tially unstable if R, is sufficiently small, whereas a doubler can be 
unconditionally stable (even when R, = 0), if it is properly designed. 
Furthermore, in the case of a down-converter, potential instability may 
be a desirable feature whereas it is highly undesirable in the case of a 
doubler. Another difference between the two cases is that in the down- 
converter of Fig. 9, the jump phenomenon always appears to be pre- 
vented by proper choice of R, and Z, , while in the case of a doubler, 
the behavior of Fig. 13 may arise when the doubler is improperly 
designed, in which case the doubler is unusable for all practical purposes. 
However, in spite of these differences (which arise in part because the 
two circuits of Figs. 9 and 10 are intended for different purposes), the 
two cases are related, for in the limit Ry; — © the circuit of Fig. 10 
becomes that of Fig. 9. 

We conclude by summarizing the derivation of inequalities (21) and 
(22). A nonlinear impedance Z (with R > 0) obeying inequality (18) 
has the following property: if an arbitrary passive impedance Z, is 
connected in series with Z (Fig. 3), and H' denotes the voltage [(Z + Z;,) 
across Z + Z,, then necessarily d | £|/d |I| > 0. In Section 2.5 
such an impedance has been termed stable. 

We derived inequalities (21) through (23) by connecting the network 
M to a de voltage supply, and by requiring that the resulting impedance 
Z be stable for all nonnegative internal resistances R, of the de voltage 
supply. This procedure is analogous to that used in ordinary linear 
time-invariant network theory for deriving the stability conditions of 
a two-terminal-pair network. In fact, the stability conditions of such 
networks are usually derived by connecting one of its two terminal 
pairs to an arbitrary passive impedance, and then requiring that the 
resulting impedance at the other terminal pair be stable (.e., that its 
real part be positive). 

In Section 2.3 it was shown that if 0U,/o7, > 0, then Z has the follow- 
ing property: if inequality (18) is fulfilled in the two particular cases, 
Rk, = 0 and Rk, = ~, then it is also fulfilled for all positive R, . Thus 
it was concluded that if Z is to be stable for all nonnegative R, , it is 
necessary and sufficient that d0,/d7, > 0, R > 0, and that the two 
inequalities (21) and (22) [which are inequality (18) for R, = © and 
FR, = 0, respectively] be fulfilled. Then, in Section 2.3, we have proven 
a theorem showing (as a corollary) that these two inequalities, and 
the inequalities 00,/al, > 0 and R > 0, are necessary and sufficient 
conditions for the stability of M. 
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APPENDIX 


Let us superimpose on /, and | F| in Fig. 2 small perturbations 
5£, and 6| £ |, and let 6| Z| and éJ, denote the resulting perturbations 
of | J | and I, . We can write 














d|E| o|# 
sie) (eB! SLED) oir 
aE, a, 
SE, aT 2 of. sl, 
where 
E = I[Z, og add, ) | I \)] (55) 
E,=0,70,,|f |) + BJ. . 
Two particular cases 6H, = 0 and 6 | | = 0 are of interest. In the 
former case, from eq. (54), 
5|E ) me) 
(5 [T Veewo * 3B.” 008) 


al, 
where J is the determinant (Jacobian) of eqs. (54), 


america aa al, ali | (67) 


In the latter case, from eq. (54), 


6H, J 
(BF) 3|£| re) 
a|I| 

Equation (56) gives the derivative of | # | with respect to |Z | when 
E, is held constant. We have already considered this derivative in 
Section 2.3, where it was shown that inequalities (21) through (23) 
are necessary and sufficient for this derivative to be positive for all 
R, = 0, R, = Oand X, . We now show that inequalities (21) through (23) 
can also be interpreted as the necessary and sufficient conditions for 


sie 
—° 0. 59 
(a 31 E1=0 a oy 


We note from eqs. (56) and (58) that 








STABILITY OF MIXERS 2167 








(i121) al# | clelé 
(2) = 6|I| ao chp die as iz a iT (60) 

















61, ab, 
ol, ar 
where d | £|/d | I'| is the derivative discussed in Sections 2.2 and 2.3. 
If inequalities (21) through (23) are fulfilled, then certainly 
d|£ | 0|£ | dE, 
Ee S rn son en 


(note that requirement (17) implicitly demands 0 | £|/a |I| > 0, 
because for R, = ~, d |f|/d |I| reduces to d | Z|/o|I|). Thus, 
inequalities (21) through (23) are certainly sufficient conditions for 
requirement (59) to be fulfilled; they are also necessary because if 
requirement (59) is violated, then, according to eq. (59), at least one of 
inequalities (61) is violated for some R, = 0, R, = O and X, , and 
we already know from Section 2.3 that in such case, inequalities (21) 
through (28) are violated. 

We have just shown that requirements (17) and (59) are equivalent. 
It can be shown, in an analogous way, that an equivalent requirement is 


J>0O forall R, 20, R20, X (62) 


[other equivalent requirements may be obtained by replacing > with ¥ 
in (17), (59) and (61)]. 
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Performance and Stability of Schottky 
Barrier Mixers 


By C. DRAGONE 
(Manuscript received June 21, 1972) 


We discuss the performance of a Schottky barrier diode as a mixer when 
the barrver of the diode 1s open-circuited at the harmonics 2w, , 3w, , etc. 
of the pump frequency w, . Such a mixer ts shown to be capable of arbitrarily 
high conversion gain provided 


W. = NW 4 


where w, 1s the cutoff frequency of the diode and yn is a parameter that is 
typically less than 6.25 and approaches 4 under certain ideal conditions. 
It 1s shown that the limit imposed by the series resistance of the diode on the 
double-sideband noise figure of the mixer ts given by 


-1 
F,, > (1 — »*) . 
We 


An experiment ts described at 1.25 GHz on a room temperature mixer 
whose double-sideband noise figure F,, as a function of gain has a minimum 
of about 0.7 dB (for gain less than unity) and a maximum of about 2.3 dB 
(for high gain). 


I. INTRODUCTION 


In the past five years the performance of microwave mixers has been 
substantially improved with the advent of high quality Schottky barrier 
diodes." The noise figures obtained so far are better by a factor of 
approximately 2 than those obtained previously using point-contact 
diodes.""* The ultimate microwave noise figure obtainable with these 
devices is not yet known, but there is reason to believe that, at room 
temperature, a figure well under 3 dB is possible. 

Calculation in a previous article’ showed that a Schottky barrier 
diode with suitable characteristics should have a noise figure well under 
1 dB provided the barrier of the diode is open-circuited at the harmonics 
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2w,, 3w,, etc. of the pump frequency w,. However, that calculation 
neglects the barrier capacitance and is therefore valid only at low 
frequency. The present purpose is to investigate the effect of the barrier 
capacitance. Three main assumptions are made: (2) the barrier of the 
diode is open-circuited at 2w, , 3, , etc, (iz) the output frequency p of 
the mixer is very low with respect to w, , and (277) the diode can be repre- 
sented by an equivalent circuit discussed in Section II. 

A mixer is commonly considered to be a linear transducer having 
finite maximum gain and a noise temperature ratio close to unity; the 
maximum gain is usually considered its most important attribute.’ 
However, this picture is not valid in general for the mixer under con- 
sideration here. It will be shown that this mixer is potentially unstable 
if the cutoff frequency w, of the diode is sufficiently high with respect 
to w,. Thus its gain is unlimited, in the sense that it can be made 
arbitrarily high by appropriately choosing the terminations at the 
input, image, and output frequencies (i.e., at w, + p and p). 

In a previous article® the mechanism responsible for instability in a 
mixer was discussed and necessary and sufficient conditions for uncon- 
ditional stability were derived. These conditions, given in Section II, 
are used to determine the relation between mixer stability and mixer 
parameters. The main result is that a mixer is potentially unstable 
(i.e., that high gain is possible) if and only if 


W, 2 WN, (1) 


where 7 is a parameter the value of which depends primarily on the 
breakdown voltage V, of the diode. It is shown that » ~ 4as Vz > © 
and that typically 


4< < 6.25. (2) 


The value 7 has important significance in connection with the noise 
performance of a mixer at high gain because the limit imposed by the 
series resistance of the diode on the ultimate noise performance is given 
by the inequality 


~1 
P. > (1 — 9%) ’ (3) 
We 


where F,,, is the double-sideband noise figure. 

Following the analysis of Ref. 5, an experiment was undertaken to 
determine the performance obtainable from a mixer satisfying assump- 
tions (2), (77), and (727). We designed such a mixer and measured its 
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behavior as discussed in Section VI. It was, as expected, potentially 
unstable. The double-sideband noise figure as a function of gain was 
found to have a minimum value of 0.7 dB, occurring at a gain less than 
unity, and a maximum of about 2.3 dB, at high gain. 

High gain in a mixer is no new phenomenon; it was demonstrated 
both theoretically and experimentally more than 20 years ago.’ Since 
then, the effect of the barrier capacitance has been treated by several 
authors.” ’” However, to the best of our knowledge, the effect of the 
barrier capacitance in a mixer satisfying assumption (7) has never been 
studied before. The amplifying ability is not a surprising property 
(Ref. 6), but good noise performance at high gain is perhaps unexpected. 


II, PRELIMINARY CONSIDERATIONS 


The equivalent circuit of Fig. 1 is assumed for the Schottky barrier 
diode; it consists of a small series resistance R, and two nonlinear 
elements, the barrier capacitance C(v,) and the barrier resistance R(v,). 
The capacitance C'(v,) and the current 7, through R(v,) are assumed to 
obey the familiar relations 


C.Vo 
re (4) 


in = tg | exp (2) = | (5) 


where ¢ is the contact potential, 7s the saturation current, g the electronic 
charge, k the Boltzman constant, and 7 the absolute temperature; 
q/kT = 40 for T = 290°K. 





Ciy,) = 


and 





Fig. 1—Schottky barrier diode. 
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Figure 2 shows a two-terminal-pair network M driven by a sinusoidal 
current 


a(t) = 2I cos wt (6) 


and a direct current J,. This network consists of the diode and two 
filters F, and I’,, . According to assumption (22), we assume for F, and 
I, the following characteristics at the harmonics of w, (and in their 
vicinity): F, is a short circuit at de and an open circuit at w,, 2w;, 
3w, , ete.; F,, is a short circuit at w, and an open circuit at de, 2w, , 3, , 
etc. From Fig. 2 the terminal current of the diode is 


a(t) = I, + i). (7) 
The voltage v, across the barrier is assumed periodic with frequency w, , 
v,(t) = % + 2 Re (Vee ne (8) 


where the dots indicate components at 2w, , 3w, , etc. Let Z, denote the 
impedance presented by the barrier at w, . 


Vor, 


Z, = R, “- jXy = T (9) 
Then the impedance Z presented by the network at w, is 
The terminal voltage at de is 
V,=V.,+ sl, . . (11) 


When this network (designated here by M; see Fig. 2) is used as a 





[ NN | 


i(t)=2 TCOSwot 
—» 


Z=Rt+jX 





Fig. 2—Network M representing the mixer. 
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mixer, a small signal is applied at the input frequency w, + p (or w, — p) 
and suitable terminations are provided at the image and output fre- 
quencies w, — p (or w, + p) and p. Signal power then flows out of the 
network at w = p to the output termination. The conversion gain, 
which is defined* here as the ratio of the output power to the power 
available from the input signal generator, has a finite maximum value 
only if M is unconditionally stable. If M is noé unconditionally stable 
(i.e., if M is potentially unstable), its gain can be made arbitrarily high 
by properly choosing the terminations at w, + p and p. To determine 
the conditions for which M is potentially unstable, we assume p X w, 
[assumption (22)]; this allows us to use the stability criteria of Ref. 6, 
which are discussed in the following part of this section. Since applica- 
tion of these criteria does not require a knowledge of the conversion 
properties of M at w, + p and p, the analysis will be concerned exclu- 
sively with the behavior of M at w, and de. 


2.1 Stability Criteria’ 


Let a one-terminal-pair network be constructed by connecting M to 
a de source as shown in Fig. 3. The nonlinear impedance Z characterizing 
the terminal behavior at w, of this network is a function of the amplitude 
I of z(t). The form of this function depends upon the characteristics 
of the de source. Of particular interest are the two cases arising when 
the de source is: (2) an ideal current source with infinite internal im- 
pedance, or (77) an ideal voltage source with zero internal impedance. 
It has been shown in Ref. 6 that a necessary and sufficient condition 
for the network M to be eo unstable is 


in = (188) 0 


in one of the above two cases. 

The network M imposes a set of nonlinear relations between its 
terminal voltages and currents at de and w, . Because of these relations, 
the impedance Z and the de voltage V, can be regarded as functions 
of J, and J, 


R=R7T,,D, X= X,,2) 
Ve = VAL. , 2). (13) 


* There are several ways of defining the gain of a mixer. A different definition will 
be used [see eq. (51)] in connection with the experiment described in Section VI, 
where the input signal generator will contain both w. + p and w. — p. However, 
the particular definition is immaterial to the analysis. 

t For the network of Fig. 2 one can show that R > 0 and 0V,/dl, > 0; if these 
aden are not satisfied, the network would obviously be potentially un- 
stable. 
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DC 
SUPPLY 


Fig. 3—Mixer connected to a dc bias supply. 


These three functions completely describe the terminal behavior of M. 
If these functions are known, the derivatives appearing in inequality (12) 
can be evaluated as follows. 

Note first that the two cases mentioned above correspond to the 
two conditions 


I, = constant, (14) 
V, = constant. (15) 

Thus, in the former case, inequality (12) takes the form* 
sp 2 < (1 oxy (16) 


involving only partial derivatives of IRU,, I) and X(I,, I) with 
respect to J. 
In the latter case, a eq. (15) we obtain 


OY. 
al, 








= 0. 


Thus, 








dl, _ av, (242) 
dl ~-—sa@l’:« ‘dz, 


Using this relation and the rule 
| ae | ne 


a” at ok. 81? 





we obtain for inequality (12) 


r > 2 
av, B¢ a(R) _ aV, oUF) = | aX oV,  , aX oY 17) 


al, Lal, ol al al, ar Ol. al, al 


Inequalities (16) and (17) will be useful in Section IV. 


4k 





* Throughout this paper 0/dJ, (or 0/01) is used to indicate differentiation with I 
(or J.) held constant. 
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In the following section we consider the behavior of M in a limiting 
case and find that a simple relation exists between Z and V, , J. Because 
of that relation, it is possible to ascertain stability directly from in- 
equality (12), rather than using (16) and (17). 


III. ANALYSIS OF A LIMITING CASE 


Analysis of the circuit of Tig. 2 is a considerable task, primarily 
because the current 7, through C'(v,) is not in general simply related to 
the total current 7, through the barrier. However, if the frequency w, 
is sufficiently high, the current zz absorbed by R(v,) is much smaller 
than 7, for all ¢, and 7, is approximately equal to the alternating com- 
ponent of 4, . 


i,(t) & 2T cos w,f. (18) 
In the present section we study this case. Our main result is inequality 
(37). 
If g and S denote the charge and elastance of the barrier capacitance 
respectively, then, since S = dv,/dq and 2, = dq/dt, we can write 


dS  dSdv,dq_ 1d{S?) . 


From eq. (4) and the fact that S = C7*(v,), 
ee (20) 


~ C.V 6)" 
Taking the derivative and substituting in eq. (19) results in 
dS 1 
a 5ever 
From this equation the alternative component of the elastance S(t) 


produced by the current 7,(¢) of eq. (18) is determined. If S, denotes 
the average value of S(¢), using eqs. (18) and (21), 


(21) 


—_ 1 yl TWot i=) yl ee) ‘ 
a ae TIE |e Dov. © ee) 
Substituting this equation in eq. (20) results in 
= ’ 
v t aed — C V/. 2S. SSS oe 


+ (-pSt +5 Stee" +), 


Wo 
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where the dots indicate components at +2w,. The amplitudes of 
v,(t) at de and w, , obtained from (23), are 


I 
V.262 6 VS > 24 
b @ ( ¢) (C, VJ) 723° ( ) 
and 
. Sot 
V=-j a (25) 


Thus, the reactance X presented by the barrier capacitance at w, can 
be written 








eames (26) 
Wo 
From eq. (24) 
en ae | ee eae eee de (27) 
Ove NE OC, VE)208 


Equations (26) and (27) specify the reactance X in terms of the two inde- 
pendent variables J and V,, . 

We now make the assumption that the diode is so operated that the 
power* absorbed by R(v,) at w, is much smaller than the power dissipated 
in Rs at w,. Because of this assumption, which is consistent with eq. 
(18), the impedance Z presented by the diode at w, is simply Rs + 7X. 
According to the preceding section the stability of the network of Fig. 2 
can be ascertained from the behavior of Rs + 7X using inequality (12), 
which reduces to 





aT (28) 


because Fs is independent of J. In order for the network of Fig. 2 to be 
potentially unstable, inequality (28) must be fulfilled under at least 
one of the two conditions (14) and (15). 

Consider first condition (15), in which case V,, can be assumed 
independent of J (V,, = V, because RI, = 0). From eqs. (26) and (27) 
one obtains 


aX _ sth 
dl 2(C, V$)'wS, 


* This power is (2(Re)V yet pave; it can be calculated to a first approximation 
using eqs. (5), (23), and (27). 


(29) 
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Let Sj, and S,, denote the maximum and minimum value of S(é). 
From eq. (22) one can verify that J and S, are related to S,, and S,, 
as follows: 


Soe ee + es (30) 


I = 0,(C, Vo)’ a a Au Es. (31) 


II 


These relations and eq. (29) yield 


dX _ 1 Su (1 — Sp/Su) 
Oar due ee SlSu) en) 
which is valid provided V,, is independent of J. 
Now consider condition (14) where V,, is a function of J. In general, 
no simple relation exists between X and I; however, in Appendix A it is 
shown that if for a given value of the ratio S,,/S,;, the inequality 


Wm 
kT 


obtains [v,, 1s the minimum value of v,(¢)], then condition (14) becomes 
equivalent to 








> 1 (33) 


S, = constant (34) 


which leads to a simple relation between X and J. In fact, using eqs. 
(26), (80), and (31), X can be expressed in the form 


r= Wi [s+ See : - 


and X is linearly related to J. Further, using eqs. (31) and (85), 


dx Sua — Sn 

I ras aoe. aie : (36) 
We now compare the two cases J, = constant and V, = constant, 
under the assumption that in the former case condition (83) is satisfied. 
From eqs. (32) and (86), for given S,, , S, , and w, , eq. (86) gives a 
larger magnitude for JdX/dI than eq. (82). Since eq. (36) corresponds 
to the condition J, = constant, we conclude that the network of Fig. 2 
is potentially unstable only if inequality (28) is fulfilled in that case. 

From inequality (28) and eq. (36), 


Su — Sm 


<= 
fis 8 4w, : 


(37) 
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which gives the values of Rs, Ss, S,», and w, for which instability 
(and therefore high gain) is possible. If w, denotes the cutoff frequency 
of the diode, so that w, = Sy,/Rs , andif S, <«< S,, , then inequality (37) 
reduces to 


wo, 2 4w, (38) 


which is eq. (1) for » = 4. 

An understanding of the practical validity of inequality (87) is 
obtained by examining the restrictions in the above analysis. It has 
been assumed that the voltage across the barrier can be determined to 
a first approximation by neglecting the barrier resistance and also 
that the power absorbed at w, by the barrier resistance is negligible 
compared with that dissipated in Rs . These assumptions are certainly 
satisfied if operation of the diode is restricted to a range of voltages uv, 
for which the barrier capacitance is predominant over the barrier 
resistance. However, this restriction is impractical because it would 
result in a mixer with very poor performance; for optimum performance 
the diode should be fully pumped; that is, v,(t) should vary over the 
entire usable range of forward and reverse voltages. In the following 
section it is shown that inequality (37) is valid, approximately, even 
if the diode is fully pumped (in which case the restriction in question 
is not satisfied), provided the breakdown voltage Vz 2s sufficiently large. 
Thus, we can say that this requirement on Vy [which is in accord with- 
the fact that inequality (87) has been derived under requirement (33)] 
is the main restriction on inequality (37). 


IV. GENERAL CASE 


According to eqs. (4) and (5), the voltage and current at the barrier 
are related through the nonlinear differential equation 


C, Vo dv, (oe = 1, _— 1, = 0, (39) 
V¢ — Vp dt 
with 7, given by eqs. (6) and (7). This equation cannot in general be 
solved exactly, but an approximate solution can be obtained fairly 
simply to any degree of accuracy by the Euler method, as shown in 
Appendix B. Using that method, Z, , V,, , and their partial derivatives 
with respect to J, and J [these derivatives are needed to test the stability 
of M using inequalities (16) and (17)] have been calculated for various 
diode characteristics and terminal currents 7, and J. Table I shows 
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TABLE I—Four EXAMPLES OF THE BEHAVIOR OF MI 























Zse% Ik 7) ~- Ee + OF 
CoueJE 31.25 Coan/b 0.1 
[Z| /Cows Vb 0.75 1.00 1225 1.75 
Von — —0.722 | —1.116 | —1.603 | —2.857 
Vin — —1.648 | —2.660 | —3.919 | —7.180 
Va —¢ —0.093 | —0.089 | —0.086 | —0.082 
Ry woCoV 0.110 0.101 0.096 0.090 
Xp: wl Vb —0.725 | —0.904 | —1.080 | —1.432 
OV 50/81 9+ woo VO 2.016 257 3.133 4.243 
IV 50/0|I|- wo o/h —1.389 | —1.762 | —2.134 | —2.878 
O(|I|Ry)/d1o* woCoVb 0.576 0.774 0.966 1.345 
A(\I|Rs)/A|I| + aoCoVb 0.0781} 0.0763} 0.0757| 0.075 
T9Xp/0Lo* woo Vb 1.403 1.784 2.166 2.937 
\Z]AXp/d|L| + woCoVb —0.542 | —0.710 | —0.881 | —1.228 
four examples where 
+ (ab/kT) * 
1 I, 
sas, =e eo, (40) 
Cw. Cw.Vo 


The four examples correspond to various values of the quantity 
I/C,w, V ¢, which represents the terminal current at w,. The values 
Vm and vy in Table I are the minimum and maximum value of »,(¢). 
Note that according to eq. (20) S,, and S,, are related to vy, and »,, 
through the relations 





V¢ — Um Vo =e, 
Sin — So ) Sau => Se 4] 
C.V¢ C.V¢ a 


One can verify that, in all the four cases of Table I, inequality (16) 
is violated for Rs = 0. Thus, in each case the circuit of Fig. 2 is poten- 
tially unstable provided the series resistance Rg is sufficiently small. 

The values of #s associated with potential instability can be derived 
as follows. In all the cases considered it has been found that if inequality 
(17) is fulfilled, then inequality (16) is also fulfilled; in other words, if Z 
satisfies inequality (12) under the condition V, = constant, then it 
also satisfies Inequality (12) under the condition J, = constant. This 
property has already been found to be true in the limiting case dis- 
cussed in the preceding section. It follows that for the network to be 
potentially unstable it is necessary (and, of course, sufficient) that 
Z= Rs + R, + 7X, satisfy inequality (16). That is, it is necessary 
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that the expression 


A(Rs + p)| (2222) ie Rs | z= (2%) (42) 


be nonpositive. This requirement is equivalent to 


Rs = Rsec ? (43) 


= 3 [oY + RY i) 4+ (2) - 2) _ p, | (44) 


In fact, one can verify that expression (42) vanishes for Rs = Rg¢ and 
is negative for Rs = Rgc. According to inequality (48), Rsc is the 
largest series resistance for which the network of Fig. 2 is potentially 
unstable. 

Figure 4 shows several curves of Rs«¢ versus | v, — ¢ | calculated [using 
Iq. (44)] for different values of 1,/C0,WV and for 7,[exp (q¢/kT)]/ 
w,0,V @ = 31.25. The four points indicated on the curve relative to 
I,/C,w, Vd = 0.1 correspond to the four cases of Table I, as one can 


where 





0.7 
0.6 Cor? 
0.5 
0.4 
0.3 


0.2 








0 1 2 3 —4 —-5 -6 —7 ~—8 -9 —10 
Vin —«(p 


Fig. 4—Behavior of Rgc as a function of v», for different values of Jo. 
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verify using eq. (44). We see from Fig. 4 that for given diode charac- 
teristics and for a given w,, the behavior of Rs¢ as a function of J, and 
|v, — @ | has the following characteristics. For a given I, , Rgc¢ increases 
monotonically with | v, — | . Since v,, cannot exceed the breakdown 
voltage, Vz , the largest value of Rs¢ for a given J, occurs when v,, = Vz. 
For a given v,, Bsc increases with decreasing J, and approaches a 
finite limit for J, > 0. Figure 4 shows that, if | V,,— | > 2 volts, Bsc is 
little affected by the value of I, for I,/w,C,V@ < 0.1, approximately. 

Now from the discussion of the limiting case of Section III, we know 
that, if conditions (18) and (33) are satisfied, then according to inequality 
(37) 


R sco a} 
Sees Se 


Condition (18) can be assumed to be fulfilled when J,/w,C, V/ ob is 
sufficiently small, in which case the quantity Rscw,/(Sy — Sm) is ex- 
pected to approach 1/4 for large | v,, — ¢ |. It is interesting to compare 
this asymptotic behavior of Rsgcw,/(Sa — Sm) with the behavior 
corresponding to the curves of Fig. 4. Figure 5 shows Rgcw,/(Sy — Sm) 
plotted as a function of |v, — ¢| for the four cases corresponding to 


(45) 


0.30 


0.25 


0.20 


Roc Wo 
Su — Sy 
oO 
a 


0.05 
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Fig. 5—Behavior of Rgcw./(Sap — Sm). 
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the curves of Fig. 4. One sees that Rscw,/(Si — Sm) never exceeds 1/4, 
and that if I,/w,C, V¢ is sufficiently small, Rscw,/(Siar — Sm) approaches 
1/4 for large | v,, — @| , as expected. The four points indicated in Fig. 5 
correspond to the cases of Table I. 

So far, the quantity zs[exp (q¢/kT)]/w.C, +/¢ has been assumed to be 
31.25; Fig. 6 shows how Rgc/w,C, Vo is affected if isfexp (qo/kT)]/ 
wC', J/é ¢ is changed from 31.25 to 1.08. The two curves of Fig. 6 have 
been calculated for I,/w,C,V¢é = 0.1. It is evident that Rsc/w.C, Vo 
does not depend critically on the value of 7s/w,C, ~/. The four points 
indicated in Fig. 6 correspond to the four cases of Table I. 


4.1 Region of Instability for Typical Diode Characteristics 


Typically, the breakdown voltage is sufficiently large so that one can 
assume ¢ — v, > 2 volts. For such values of ¢ — vm one can verify 
from Fig. 4 that Rsow,C, V6/Vd — vm > 0.16 if I,/w,C,Vb < 0.5. 
Thus, one can assume for typical diodes 


Rscw.C. Vo < 0.25. 





0.16 < (46) 


In the introduction, the range of pump frequencies associated with 
potentially unstable behavior was expressed in terms of the parameter 7. 
A comparison of inequalities (1) and (43) shows that this parameter is 


0.25 


ig= 31.25 ~~ 


0.20 


0.05 





Vin -® 
Fig. 6—Effect of 7s. 
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related to Rsq as follows: 


n= we Rs = Sar a V¢ 7 Um (47) 





Thus, according to inequality (46), 7 typically lies between 4 and 6.25, 
as was stated in the introduction. 


V. EFFECT OF Rs ON NOISE PERFORMANCE AT HIGH GAIN 


The limit imposed by Rs on the optimum noise performance obtain- 
able at high gain is now derived, when the diode is used as a down- 
converter and is terminated with equal impedances at w, + p. The 
effect of Rs on noise can be separated into two parts: one is the effect 
at w, + p, and the other at the output frequency p. We will see that if the 
diode is operated at high gain, the effect at w, + p is minimized when the 
diode is terminated with a very high impedance at w = p. Under this 
condition the effect of Rs at w = p vanishes and can therefore be 
ignored. 

Consider the effect of Rs at w, + p. If R, denotes the real part of the 
equal terminations at w, + p and w, — p, the real part of the total 
impedance terminating the barrier at wo + pis Ry, = Ra + Rs. 
Now inequality (43) implies that high gain is possible only if Ra, S Rsc, 
that is, only if 


R. = Rego — Rs . (48) 


Furthermore, when the resistance R,, equals Rgc¢ (i.e., when Ry = 
Rsc — Rg), one can show that high gain requires a very high output 
impedance in all cases of Section IV. (This is a direct consequence of the 
fact that, when the resistance seen by the barrier at w, equals Rese; 
instability may arise only if J, = constant; that is, only if the diode 
is biased by a de supply with infinite internal impedance.) 

Now the ratio of the thermal noise power available (at w, + p) at 
the barrier to that available at the terminals of the diode is Ry:/R. . 
This represents the impairment caused by the presence of Rs at w, + p 
on the noise performance of the diode as a down-converter. According 
to inequality (48), this impairment is minimized when #&, equals 
Rsc — Rs , in which case a very high termination is required at w = p. 
Thus, since the effect of Rs at w = p vanishes, and at w = w, + pis 
given by the ratio Rsc/(Rsc — Rs), the noise figure can be written 


Rse 


Fy, = 5 
Rsc — Rs 


F, (49) 
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where F is the noise figure obtainable at high gain in the ideal case 
Rs = 0, when the termination at w = p is a very high impedance. 
Using eq. (47) this relation can be rewritten 


-1 
F,, = (1 a n®*] F (50) 
We 
from which one obtains inequality (3). 


VI. EXPERIMENTAL RESULTS* 


Experimental data have been obtained at a pump frequency of 
1.25 GHz using a GaAs Schottky barrier diode,t having Rs & 4 ohms, 
C, = 1.2 pF, and Vz & 10 volts. This diode was mounted in a circuit 
designed to produce at the barrier very high terminating impedances 
at 2w, , 3w, , and 4w, . The structure is shown in Fig. 7 with the cover 
plate removed; the diode is inserted between two resonators both of 
which operate in the TEM mode. One resonator is connected to a 
50-ohm coaxial line and consists of a main line with two series-resonant 
circuits connected in shunt. The purpose of the two series-resonant 
circuits is to short out transmission at 2w, and 4w, between the 50-ohm 
coaxial line and the diode. Each circuit consists of a line element con- 
nected to the main line at one end, with a lumped capacitance between 
the other end of each line element and ground. 

The main line has the diode chuck soldered at one end; the other end 
is open-circuited. The electrical distance between the open-circuited 
end and the connection point at which the coaxial line is attached is a 
quarter of a wavelength at 3w,. The connection point of the coaxial 
line is therefore also short-circuited at 3w, . The distance from 
this connection point to the diode and the dimensions of the other 
resonator were chosen so as to open-circuit the barrier of the diode at 
2, , 3w,, and 4w, , using the following procedure. Initial dimensions 
for the two resonators were obtained experimentally by adjusting one 
resonator at the time (the other resonator and the diode being removed). 
The L-shaped resonator was adjusted so as to obtain two resonances 
at 2w, and 4w,. The other resonator was adjusted (with the 50-ohm 
line terminated in 50 ohms) for a resonance at 3w,. Then an empty 
package identical to that of the diode used in this experiment was 
mounted between the two resonators as shown in Fig. 7, and the reso- 
nant frequencies of this circuit were measured by coupling the circuit 

* The experiment described in this section was carried out by S. Michael of Bell 


Laboratories. 
t Supplied by J. C. Irvin of Bell Laboratories. 
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— DIODE 
~ 


50-OHM _— 
COAXIAL LINE 


Fig. 7—Circuit used to open-circuit the barrier of the diode at 2w,., 3w. and 4w». 


to a load and a generator by means of two loosely coupled capacitive 
probes. The resonant frequencies were found to be appreciably different 
from 2w, , 38w,, and 4w, as expected, because of the coupling between 
the two resonators resulting from the case capacitance of the package. 
The dimensions of the two resonators were then adjusted to give the 
desired resonances at 2w, , 3w,, and 4w, and the empty package was 
finally replaced with the actual diode. 

In order to separate the pump frequency w, from de, the 50-ohm 
coaxial line of the circuit of Fig. 7 was connected to one of the three 
arms of a monitor tee consisting of a main line shunted by an auxiliary 
line. The two lines are provided with a capacitor and an inductor con- 
nected in series to their central conductors to block signals in the vicinity 
of de and w, , respectively. The other two arms of the monitor tee are 
connected to an output matching network and a tuner, as shown sche- 
matically in Fig. 8. 

Figure 9 shows a block diagram of the apparatus used to measure the 
noise characteristics of the mixer of Fig. 8. The noise source is the AIL 
type 70 Hot-Cold Body Standard Noise Generator [consisting of two 
terminations, one immersed in liquid nitrogen (77.3°K) and the other 
mounted in a temperature-controlled oven (373.2°K)]. The pump is 
connected to a narrowband filter. This filter consists of four identical 
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Fig. 8—Mixer. 
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Fig. 9—Block diagram of apparatus used to test the mixer of Fig. 8; w. = 1.25 
GHz; p = 2 MHz. 
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cavity resonators tuned at w, and separated by transmission lines 
\/4 long at w, ; the attenuation at w, is approximately 10 dB and, at 
w, + pfor p = 2 MHz, is greater than 55 dB; its fractional bandwidth 
is approximately 0.07 percent. The noise source is buffered by an isolator 
providing more than 22 dB isolation at w, + p. The noise components 
originating from the source at w, + p (p = 2 MHz) enter the first 
circulator and are directed into the pump filter which reflects them back 
to the circulator where they are directed into a second circulator. The 
second circulator, which provides 22 dB of isolation at w, + p, is 
followed by the mixer, the circuit of which is shown in Figs. 8 and 9. The 
noise power entering the mixer at w, + pis converted to the output (IF) 
frequency p = 2 MHz. The converted power is then amplified and 
finally measured with a narrowband receiver. The IF amplifier system 
consists of a calibrated variable attenuator followed by an amplifier, 
the noise figure of which was optimized at 0.19 dB for p = 2 MHz. 
The purpose of the variable attenuator is to vary the noise figure F’; of 
the IF amplifier. 

The double-sideband noise figure of the mixer-amplifier combination 
is given by the familiar relation 


P= PF, + 74, (51) 


where F,, is the double-sideband noise figure of the mixer* and G,, is 
its gain. Note that G,, is not the conversion gain from w, + p to p, or 
from w, — p to p, but is the sum of the two. That is, if Gg and Gy¢ 
denote these two gains, G, = Gag + Gy, (thus G, & 2G.¢ because 
Gag = Gs). Measurement of F, consists essentially of determining 
two quantities: (7) the ratio Y = P,/P, , where P, and P, are the power 
outputs of the IF amplifier corresponding to the two temperatures 
T, = 373.2°K and T, = 77.3°K and (72) the insertion loss ¢ of the circuit 
connected between the source and the mixer; this loss is less than 1 dB 
and can be measured very accurately. The noise figure is given by the 
well-known formula F, = [1 + (7. — T,Y)/290(Y — 1)]é. The accuracy 
of measurement of F, is limited primarily by the accuracy of the two 
temperatures 7’, and T, ; the estimated error for F, is less than 0.1 dB. 

According to eq. (51), F,, and G,, can be determined indirectly by 
measuring the effects of F, of varying the IF noise figure F, . Figure 10 
shows a plot of F, versus F; which was obtained after adjusting the 

* F’, is the ratio of the total noise power output of the mixer to that portion of this 


power originating from the terminations of the mixer at w. + p and w. — p, assuming 
these terminations are at 290°K. 
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Fig. 10—Behavior of F, versus F;; mixer optimized for F; = 0.19 dB. 


mixer for minimum F, under the particular condition F; = 0.19 dB. 
According to eq. (51) the slope of this curve is the mixer conversion 
loss ZL, = 1/G,, and the point corresponding to F; = 1 is F,,. This 
curve tells us that for a gain of 0.948 the lowest noise figure obtainable 
from the mixer is 1.174. 

Figure 11 shows a plot of the minimum noise figure of the mixer 
versus its conversion loss. From this plot we can derive the lowest F, 
obtainable for a given F; as follows. From eq. (51), after replacing 
1/G,, with L,, and differentiating, we get 


aP, dP, 
dL, = dL,, ot (F; fe 1) " (52) 
0.75 
0.50 
in 
iE 
0.25 
0 
0 0.25 0.50 0.75 1.00 1.25 1.50 
bml=Gm7)) 


Fig. 11—Behavior of F,, versus Im = Gnt. 
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Because the curve of Fig. 11 has d’F,,/d’L,, > 0, it follows that a point 
on the curve minimizes I’, if it satisfies dF,/dL,, = 0 which, from eq. 
(52), results in 


F,=1-—. (53) 


Using this equation, we obtain from Fig. 11 the curve of Fig. 12, which 
shows the relation between F’; and the optimum value of L,, . One sees 
that it is desirable to have L,, < 1 (i.e., gain greater than unity) when 


F, > 1.37 (~1.4 GB) . (54) 


VII. CONCLUSIONS 


It has been shown that a Schottky barrier diode is capable of arbi- 
trarily high conversion gain as a mixer provided w, < w,/7, where 7 is 
a parameter typically less than 6.25, but always greater than 4. If 
w, < w,/n, then according to inequality (3), the ultimate double- 
sideband noise figure at high gain should be very close to unity. A 
fortiori, the ultimate noise figure at low gain should be excellent, if 
w, < w,/y. These conclusions are corroborated by the experimental 
results. Although the experimental result obtained is very good (Ff, , 0.7 
to 2.3 dB), it does not achieve the theoretical limit, nor was it expected 
to; practical limitations such as input circuit losses, estimated to 
exceed 0.4 dB for G,, > 1, confine the experimental noise figure for 
Gm >> 1 to values appreciably higher than the theoretical limit. 


0.75 


0 
0 0.25 0.50 0.75 1.00 1,25 1.50 


Lm 


Fig. 12—Relation between F; and the optimum value of Ln. 
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APPENDIX A 


In this appendix, we analyze the condition J, = constant for the 
case of |(qu,,/kT)| >> 1 of Section ITI. The average current through the 
barrier resistance is 


I, = ON aisle BO —_ 1, : (55) 


Thus, if a perturbation 6J is applied to the diode current at w, , 
while J, is held constant, from eq. (55) the resulting perturbation 
of v,(é) must satisfy the condition 


(eee Dae) 50, (t))ave = 0. (56) 
According to eq. (20) this condition can be rewritten in the form 
(e ?F"© S(t) 5S(t))ave = 0, (57) 
where 
ee a)? 
B= i (C.V6). (58) 


Note that S(t) is completely specified by its maximum and minimum 
value S,, and S,,. We will presently show that if for a given value of 
the ratio 


(59) 


al 
Sar 
we let Sy, > © then the time function exp [— BS*(d)] over the interval 
—T/2 < t S T/2 approaches an impulse located at ¢ = 0. Thus, if A 
denotes the area of this impulse, we will show that for a given value 
of r we can write: 


exp[—BS’} > Au) (| t | S T./2) (60) 
for Sis — ©, where u,(t) denotes the unit impulse. From this result 
and the fact that S(0) = S,, we find that for Sy, > © Eq. (56) becomes 

S(O) 6S(O) = S,, 68S,, = 0. (61) 


We can thus say that for S, — ©, condition (14) is equivalent to 
condition (34). We now show that relation (60) is true for Sy, > ©. 
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Let ¢ be an arbitrarily small positive quantity. We have to show that 
for a given value of r 


‘| exp [—BS*(t)] dt 








Jim. 62 
[ exp [—BS*(4)] dt 
0 
First, note that S(t) can be written 
S() = su rt = = cos ant : (63) 


Using this expression one can verify that exp [— BS’(t)] < exp [—BS’(6)] 
fore <¢ S T/2. Thus, 


/ exp [-BS%(B)] dt < 7 exp [BS Oy. (64) 


Furthermore, it can be shown that if S,B is sufficiently large then 


T —r?SM?B 
2 +/2er(1 — r)BS?, , 
In fact, since cos w,f > 1 — (w,t)’/2, from eq. (63) 


” exp [—BS*()) di > 2 (65) 


so < %[r+ 99 DG pal 


= [7+ 05% iy ee a at] 


This inequality allows one to write 


T/2 
i exp [—BS*(d)] dt 
0 
T eT Sar arVr(l-r) BS M?2/2 F : 
> On Vr — NBSg/2 : exp [—u — yu'] du, 


where y = (4BS%,;r°)~*. If BS, is sufficiently large, so that y < 1, 
then from this inequality we obtain (65). 
From inequalities (64) and (65) we obtain 


| exp [—BS*(t)] dt 





or Pa VJ BS, (1 — rr eo BIS arts a) (66) 
/ exp [—BS°(2)] dt 
0 
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According to Eq. (63) we can write 
S*(e) — Su = Su8,(e), (67) 


where &,(e) is a positive quantity which depends upon r and e¢, but is 
independent of S,,. From eq. (67) and inequality (66) we therefore 
conclude that eq. (62) is true. Note that, since ¢ is a small quantity, 
from eqs. (63) and (67) we can write 


2 
8,(e) & ae 
Thus, 
T/2 
/ exp [—BS*()] dt 
Se 7a K 1 
/ exp [—BS?(8)] dt 
0 
provided 


2 
2 oe 
BSx > (i ryre” 


APPENDIX B 


Figure 13 shows a network consisting of a variable elastance, a 
variable resistance, and a constant resistance R{. Assume that the 
current 7% through the variable resistance is related to the voltage vj as 
follows: 


ih = en (68) 
and that the variation of the elastance S’ is characterized by the relation 
Si = V0. (69) 





Fig. 13—-Network representing a Schottky barrier diode. 
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Let the terminal current 7{ of this network be a periodic function of + 
with period 27, so that 


1 = U(r) = U(r + Qn). (70) 

Then, 2{ and vj are related through the differential equation 
1 ah 
a/ —v dr 


Furthermore, one sees from Fig. 13 that the terminal voltage v, can be 
written 





+ vere = i — 0. (71) 


vg = URL + 5. (72) 


A Schottky barrier diode can always be represented by such a network. 
In fact, if we assume 


T = wol 

Vv, =v, — ¢, vi =v, —G$+71,R, 

se cen 

oe Cw. Vo 

Ge ve Cavs 
Ri = Rw.C,V¢$ (73) 


and substitute these relations in eqs. (71) and (72), we obtain eq. (39) 
and the relation v», = 7,R, + v,. Thus, the two networks of Figs. 11 
and 13 are equivalent. 


Analysis 
Let us now assume for 2} the form 
Cy for +750 
Ii + 2I’ cosr, for 7 >0 


IA 


% = U(r) = (74) 
where C is a constant. Since then 74 = 72 = C for7 S 0, from eq. (68): 


vy = (7) = - (nC — Inz) for 7 <0. (75) 


Since for 7 = 0, z{(r) is periodic, it is reasonable to expect that for r 
sufficiently large, v}(r) will also be periodic. Let us suppose there is a 
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positive integer N such that one can write to a good approximation 
vi(2arN + 2x) & vi(2rN), (76) 


so that vf(r) can be assumed to be periodic for r = 2zN, 


u(r + 2x) & vi(r), for +r 2 2rN. (77) 
Then we can write 
vi(r) = vf, + W(Relvfie’ + ---) for +r = 2N, (78) 


where the dots indicate the components of order 2, 3, ete., and 
1 2rN+2Q24r 
Vio = 3 v3(r) dr (79) 


Qn 2aN 


and 


i 2arN+24r 


I 


vi,==— ui(r)e"" dt. (80) 
20 Jorn 

Thus, if we can determine the behavior of v{(r) over the interval 2rN S 
rt < 2n(N + 1), where N is the smallest positive integer for which 
condition (76) can be assumed to be fulfilled, then the coefficients 
Vi, and V{, can be readily determined by these two relations. 

For given values of C, I? and I’, an approximate solution to eq. (71) 
can be obtained using the Euler method. This method requires that the 
continuous variable + be replaced by the discrete variable nr, (n = 0, 
+1, +2, etc.). Since in eq. (74) for r 2 0 f(r) has period 27, it is 
convenient, to assume for the step size 7, an exact submultiple of 27, 

2r 
ea} (81) 
where P is a positive integer. In the Euler method of solution, eq. (71) 
is replaced by the difference equation 


Vie. = v,, + D7. ) (82) 


where v/ is the approximation to vj for 7 = nr, , and D, is the approxi- 
mation to the derivative dvi/dr for r = nr, . From eq. (71) 


Dy = V 05 (ig — ie") (83) 
where 
/ f 
ven \" + 2I’ cos (nr,), for n> 0 . (84) 


C, for n= 0 
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Equations (82) to (84) show that the values of v, , », , v2 , ete. can be 
calculated sequentially, starting with 


D20:. os (i) (85) 


$ 


then calculating vj and D,, and so on. Let N be the smallest integer 
for which the condition 


VP (N+1) = Uby (86) 


is satisfied. Then, according to eqs. (79) and (80) the desired approxi- 
mations to V’ and V’ are . 


122 

y=5 Uk+PN 
oe (87) 
ES ° 

=s pa UK+PNE | K4PN + 


Table I* and Figs. 4 to 6 have been calculated choosing as initial 
condition 


w= C= 54+ 20. (88) 


The value of N depends on the value of Jj . One can show that N > ~ 
for I, — 0. Thus, the above method is not suitable if J} is too small. 
However, for the values of J§ that are of practical interest, NV is typically 
a small integer; for instance, NV S 4 in all the cases in Table I. 
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The Accuracy of Call-Congestion 
Measurements for Loss Systems 
with Renewal Input 


By A. KUCZURA and S. R. NEAL 
(Manuscript received June 23, 1972) 


The concept of a generalized renewal process is used to derive an asymp- 
totic approximation for the variance of the observed proportion of un- 
successful attempts on a trunk group during a given time-interval. Calls 
are assumed to arrive according to a general renewal process, and those 
which are blocked leave the system and do not return (loss system). 

As an application of our result we examine the special case of an overflow 
input-an amportant example from telephone networks with alternate 
routing. Comparison of our results with values obtained from simulation 
indicates that the approximation is quite accurate for telephone traffic- 
engineering purposes. 


I. INTRODUCTION 


In a communication network, the proportion of unsuccessful attempts 
on a trunk group during a specified interval of time is called the measured 
call-congestion, and is used to estimate the single-hour blocking prob- 
ability for many of the trunk groups in the Bell System network. In 
order to determine how many measurements should be taken to properly 
assess system performance, one needs to know the statistical accuracy 
of the estimated blocking probability. 

In the context of telephone traffic-engineering, the measured call- 
congestion is an unbiased estimate of the blocking probability, and 
hence we use its variance as an indicator of the precision of the measure- 
ments. For loss systems with exponentially-distributed service times, 
the variance has previously been studied under the assumption that 
calls originate according to a Poisson process.’ However, attempts on 
a trunk group are well approximated by a Poisson process only for those 
groups which do not serve overflow traffic from subtending groups, so 
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that earlier results do not cover intermediate high-usage and final 
groups. 

Assuming that the arrival process is of the renewal type, we derive 
an asymptotic approximation for the variance of the measured call- 
congestion for loss systems with exponentially-distributed service times. 
Since a single stream of overflow traffic is a renewal process, our results 
provide an estimate of the accuracy of call-congestion measurements 
made on intermediate high-usage and final trunk-groups.* 

In Section II we describe the mathematical model used to solve our 
problem. The asymptotic approximation is derived in Section III. 
We also consider the ‘number of calls carried” as an unbiased estimate 
of the carried load and derive an asymptotic estimate of its variance. 
Section IV contains numerical results and Section V consists of a 
summary and our conclusions. 


Il. MATHEMATICAL MODEL 


We consider a system of c servers serving customers, the arrival 
epochs of which constitute a renewal process. We assume that the 
interarrival times are independent and identically distributed according 
to the distribution function F, and that the service times are also 
independent and identically distributed according to an exponential 
distribution with unit mean. If all servers are occupied when a customer 
arrives, he leaves and has no further effect, on the system. If an idle 
server is available when a customer arrives, service begins immediately. 

Let (0, ¢] denote a time interval of length ¢ which commences at a 
point chosen at random on the time axis. Let N(t) be the number of 
arrivals and O(t) the corresponding number of blocked attempts in 
(0, t|. The ratio O(t)/N(@ is the measured call-congestion. In Section III 
we show that the variance of O(t)/N() can be approximated in terms 
of the covariance between O(t) and N(t) and of the individual first 
two moments of O(f) and N(t). We now describe the mathematical 
model used to obtain the required moments. 


2.1 A Multi-Dimensional Renewal Process 


Let t,, » = 0, 1, 2, --- , be the instant of time at which the nth 
overflow occurs, lb < 0 < t; < --- , and set X, = #, — t,-,. The 
interoverflow times X, ,n = 1, 2, --- , form a sequence of independent 


(because holding times are exponential) and identically distributed 


* For engineering purposes, the total overflow traffic offered to such groups is 
adequately described by a single overflow process.? 
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random variables. Let K,,n = 1, 2, --- , be the number of arrivals 
occurring in (é,-; , t,] and define the row vector 


x, = (1, K,), m=1,2,°--. 


Since K,,n” = 1, 2, --+ , is also a sequence of independent and identi- 
cally distributed random variables, the components of the vector 
sequence x, ,n = 1, 2, --- , are independent and identically distributed. 
Now, set 

n(t) = > Xn » 


where the sum is taken over all n such that 0 < t, S t. With these 
definitions it follows that for large t, 


nt) = (OW, N@) 


and that (x, , X,),” = 1, 2, --- isa multi-dimensional renewal process.° 
Consequently, the results communicated by J. M. Hammersley’ in 
the discussion of W. L. Smith’s paper apply directly to our model. 
In particular, we shall use his eqs. (25) and (26) to compute the moments 
of n(Z). 

We could also have used Smith’s results on cumulative processes to 
obtain, in an indirect way, the covariance between O(¢) and N(t). 
However, the concept of a cumulative process is not as naturally suited 
to our problem as is Hammersley’s generalization of a renewal process. 

Let u,(c) = E[X7] be the nth moment of the interoverflow times from 
a group of c servers and 


= | fare. 
For brevity we shall denote the arrival intensity vy’ by \. Equation (25) 


of Ref. 3 states that 


t 
E[O@] = ae) (1) 
and 
ee 
(Cc) 


But since E[N(t)] is also equal to XZ, we have (as is clear intuitively) 


E{K,|] = Xu (c). (2) 


EIN()] = E[K,]. 
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Now, using eq. (26) of Ref. 3 we have for large t 
t 
Cov [O(t), N(t)] ~ ZO {Muo(c) — E[K,X,]}. (3) 
. 1 


In this expression, as in others below, we omit terms which behave as 
o(t) for large t. The results of Section IV indicate their contribution to 
be negligible. 

Since the overflow epochs constitute a renewal process, and since 
we have a renewal input, we can also use eq. (26) of Ref. 3 to obtain 


Var (0()] ~ acy [uale) — HO] (4) 
and, with a change in definitions, 
Var (V()] ~ 3 bs = vt] (5) 


for large values of t. Since », and v, can be computed directly from F, 
we only need p,(c), w2(c) and E[K,X,] in order to evaluate the asymptotic 
expressions (3), (4) and (5). 


2.2 The Joint Distribution of K, and X, 
Let 


g-(k, t) di-= dP{K, = k, xX, ss t}, 


where the differential on the right-hand side is to be taken with respect 
to the variable t. Using the same arguments as those presented in Ref. 4, 
pages 388-389, we obtain 


gk, t) a e 'ge-i(k, t) 
a om [ (1 —e™“)g.(k — m, t — u)ge-1(m, u) du. (6) 


If we define 


yw, 8) = [Yo w'glle, Ne at, ) 
0 k=1 
then it follows from (6) that 


Y--1(w, s 1) ; 
1 al Ye-1(W, s) a Ye-1(W, 8 ae 1) 


This relation is identical to relation (7) of Ref. 4 derived for the case of 


(8) 


ve(W, 8) = 
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Poisson input, and (contrary to the author’s comment) is also valid for 
any arrival process of the renewal type. 

Relation (8) is of the same form as the recurrence relation for the 
Laplace-Stieltjes transform of the interoverflow distribution for loss 
systems with renewal input (see Ref. 5, page 37). Consequently, we can 
follow the outline of the analysis in Ref. 5 to obtain y, . 

First, Riordan’s results imply that y,(w, s) can be written in the 
following form: 


yew, 8) = rere (9) 


where D,(w, s) = 1, and, as can be seen by setting c = 0 in (7), 


Dw, s) — (10) 


was) ’ 


where 


se I ” et dF (e). 


IV 


Furthermore, for r = 1, 





1 
D,.:(w, 8) = D,(w,s) + E- — | D,(w,s + 1). (11) 
Following Riordan, we define 
1 
A; = A,j(w,s) = 1 - mee (12) 


Now using (10) and (11) and mathematical induction (as noted by 
Riordan) one can show that 


D,(w,s) = 1+ > -'(")ros OP NG pr (13) 


Finally, since 
min 


m Tt n 0 
E[KTX?] = (-1) Sw ag Ves s) 





w=l 
s=0 


form = 0, land n 2 0, we can compute the required moments directly 
from (8). 

First, differentiation of (8) with respect to s yields the following 
results: 


My (c) = »D, 
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and 
pale) =“ m(@) + 2m) Xml) — 2, DE”, (14) 
where 
D, = D.i,1) =1+ S (—1y'(%) aod, Sp ess (15) 
is the reciprocal of the generalized Erlang B blocking, 


and 





te] 
D&E? = — Du, s 
0s ( d ) er 
s=1 


Performing the last operation, we have 
wr) _ ¥ _41yif © ae 2, Mt sig Gi. 
D. 7 2 ( y (0) sod, A, 2 . Ay " = Aj-y a) 


where ; is the derivative of \,(1, s) evaluated at s = 1, 1e., 


» _ wk + 1). 
* &(k + 1) 


Similarly, the joint expectation of K, and X, is given by 


Q 


E(K,X,] = b(c) ae = by (c) 2s p(k) +», D?” a Dos, (17) 


where 





fe] 
Do” = — D.u,s 
aw ( ) ) ra 
s=1 


is given by 
a i aoe es 
D, = 2 ( 1) (°)aca, Av] 2 + Ay + + Ay J 


and D&” is given by (16). 
Before concluding this section, we show that the covariance function 
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(3) reduces to the expression given by Descloux’ for the case of Poisson 
input. Substituting (14) and (17) into (3), we have 





ve—v De? +», DY” ; 
Cov [O(t), N(d)] ~ ae | fo ae | (18) 


Since y;'(c) = \B(c, d), where B(c, \) = D;" is the generalized Erlang B 
blocking, we can write 


Cov (0(), N(O) ~ MBE, NAS" — Be, [DY + ad9)} 


When the input is Poisson, we have the following simplifications: 


2 





Vo Be V; = 1, 
Vy} 
k+1 
om, ee 
1 
Pood ee 
Q; r ? 
Do” ste De”? = -Si i : i 


ng ets 
ar rN On EY.) 
where E,,.(A) is the Erlang B blocking probability 


© 
E,,.(\) = 





Ni 


c! : 
7=0 j! 


With these simplifications the covariance function becomes 
Cov [O(), N(D)] = Ms aN > (NE, (A)]. 
Substituting for the derivative (see Ref. 6, page 1) 
By.) = 2 fe — +B, OEL..0), 


we obtain Descloux’s result 


Cov [O(), N@] ~ AMEi, (EL + ¢ — X + MEL, (A). 
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Ill. CALL CONGESTION AND CARRIED LOAD VARIANCES 


In Section II we derived asymptotic approximations for the variances 
of N(é) and O(£) and the covariance between the two. These expressions 
can now be combined to obtain an asymptotic approximation for the 
variance of O(t)/N(t). Moreover, without additional effort we can also 
approximate the variance of 


N(t) = OO 


L(t) = (19) 


which is the number of calls carried per mean holding time, i.e., an 
estimate of carried load. 
3.1 Variance of Call Congestion 


From the theory of standard errors the variance of the call congestion 
is given approximately by’ 





te [28 | 2 ae (at (O(] , Var {N(] _ 2 Cov (oC), Neo} F(0() 


Ni) EO) © BINO] EO@JEIN(®)] J ENO] 
(20) 


The derivation of this expression is based on squaring the first-order 
terms of a Taylor series expansion of O(t)/N(é) about the means of 
O(t) and N(t); the higher central moments of O(é) and N(é) are omitted. 
The accuracy of the approximation is discussed with the numerical 
results in Section IV. Using eqs. (1), (2), (4), (5), and (18) to substitute 
for the various quantities on the right-hand side of (20), we obtain 


O00) 2 1 (wid — Hi) 2% 
me Ee nO i 


sSyor-esami 0 


3.2 Variance of Carried Load 


The expectation of (19) is 


F(g()] = ve = 5: 
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Since 1/[Ay,(c)] is the blocking probability, we have 
E[£@] = All — Be, »)] 


which shows that (19) is an unbiased estimate of the carried load. The 
variance of this estimate is given by 


Var [£(t)] = z {Var [N(t)] + Var [O()] — 2 Cov [O(t), N(d]}. (22) 


Substituting for the various terms on the right-hand side of (22), we 
obtain for large ¢ 


Yee ~ fe alo 4 9 [Zo 


2 
ic) 





te [pe + vi, Ds}. (23) 


IV. NUMERICAL RESULTS 


We are primarily interested in the accuracy of the measured call- 
congestion when the input is overflow traffic. Of course, one expects 
that the variance of the measured call-congestion will increase as the 
variance-to-mean ratio (peakedness) z of the offered traffic increases. 
We must also test the accuracy of the various analytical approximations. 
In particular, we must determine whether the standard measurement 
period of one hour (about 20 mean holding-times) is long enough for 
the asymptotic expressions (3) through (5) and the approximation (20) 
to be accurate enough for engineering purposes. 

We computed the estimate (21) of call-congestion variance for trunk 
groups of 6, 10, 20, 30 and 40 trunks serving overflow traffic with various 
values of peakedness ranging from one to ten and a measurement 
interval of £ = 20 mean holding-times. In each case, the offered load 
was varied over a range from 0.05 erlangs/trunk to 2 erlangs/trunk. 
The interarrival-time distribution of the originating traffic was obtained 
by using the Interrupted Poisson process’ with a three-moment match. 

To check the accuracy of the various approximations, estimates of 
the variance were obtained by simulation for each of the cases mentioned 
above. The results for the five different trunk groups were of the same 
general form as for the ten-trunk case shown in Figs. 1 and 2. 

Figure 1 is a graph of the standard deviation of the measured call- 
congestion ¢, = {Var [O(t)/N(t)]}? vs the offered load a = \ = »;' for 
several values of z and c = 10 trunks. Notice that for a fixed value 
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Fig. 1—Standard deviation of measured call-congestion vs offered load force = 10 
trunks and overflow input having peakedness z. 


of a, og increases as z increases (as was expected). The curves are 
terminated in the region of z = a. In general, we found that our analytic 
results were in good agreement with the simulation for a > z. This 
verifies the accuracy of our approximations when a > z. However, a 
notable disparity occurred in several cases when a < z. The latter 
inequality rarely arises in telephone traffic; but for other applications, 
where a < z might occur (e.g., data transmission), further work is 
required. 

Figure 2 illustrates the behavior of the coefficient of variation o,/B 
of the measured call-congestion. For a large range of offered loads, the 
coefficient of variation decreases as z increases, indicating that the 
blocking probability increases faster as a function of z than does the 
standard deviation. Although the variance of the measured call-conges- 
tion decreases as a decreases for small a, the coefficient of variation 
(relative error) increases hyperbolically. Consequently, the relative 
accuracy of the measured call-congestion decreases as the blocking 
probability decreases, i.e., as the number O(t) of observed overflows 
decreases. 

Figure 3 displays the coefficient of variation as a function of aB = 
E{O()/t for several trunk-group sizes. In each case we used both z = 1 
and z = 10. The slope of the curves on the log paper is approximately 
—1/2. Hence, the coefficient of variation of the measured call-congestion 
is approximately inversely proportional to the square root of the number 
of blocked calls observed. 
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V. SUMMARY 


For loss systems with renewal input and exponential holding times, 
we derived asymptotic approximations for the respective means and 
variances of N(t), the number of arrivals, and O(¢), the number of 
overflows in the measurement period (0, ¢]. We also obtained an asymp- 
totic estimate for the covariance between O(t) and N(é). Using these 
results, we obtained an estimate of the variance of the measured call- 
congestion O(t)/N(t), as well as the variance of £(f) = [N(t) — O()]/t 
which is an unbiased estimate of carried load, provided the mean holding 
time is known. 

Our analytical approximation for the variance of O(t)/N(t) was 
checked by simulation for systems serving overflow traffic. In those 
cases which were tested, the simulation results were in excellent agree- 
ment with the analytical results for the range of system parameters 
(roughly z < «@) which normally arise in telephone-engineering applica- 


ANALYTICAL RESULTS 
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Fig. 2—Coefficient of variation of measured call-congestion vs offered load for 
c = 10 trunks and overflow input having peakedness z. 
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Fig. 3—Coefficient of variation of measured eall-congestion vs overflow rate for 
c = 6, 20, 30, and 40 trunks, and input traffic with peakedness z = 1, 10. 


tions. We also found empirically that the coefficient of variation of 
O(t)/N(@) is approximately inversely proportional to the square root of 
the mean number of overflows in (0, f]. 
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A Cost Optimization Model for Seismic 
Design of Structures 


By S. C. LIU and F. NEGHABAT 
(Manuscript received July 17, 1972) 


Considering the earthquake susceptibility of structures located in seismic 
regions, the question arises as to what level of protective measures should 
be provided in order to achieve a certain degree of reliability against possible 
damage. To address this question, engineering risk and optimal design 
of structures located in a seismic area are studied. The basic concept is 
to obtain a tradeoff between the cost of providing a protective measure 
and the expected cost of earthquake damage. 

A simple mathematical approach is presented to determine the optimal 
earthquake intensity which the structure 1s designed to withstand. The 
objective 1s to minimize the total construction cost of the structure plus 
the expected cost of earthquake damage throughout the entire service lrfe 
of the structure. For the case of deterministic structural resistance, and 
for structural response processes having Poisson (independent) crossings, 
an objective function is derived in terms of the building and earthquake 
variables. The optimal design intensity can then be determined by mint- 
mizing the objective function with respect to the intensity variable. The 
resulting equations are relatively simple and can be easily handled for 
numerical studies and sensitivity analysts. Generalizations of the results 
for nondeterministic structural resistance and for structural response 
processes different from those having Poisson crossings are also indicated. 

As an illustration of the proposed approach, a hypothetical building 
with realistic seismicity and structural parameters is analyzed for its 
optimal design earthquake intensity. The construction, damage and total 
costs are obtained in terms of the intensity variable. The tmplications and 
sensitivity of the results are also discussed. 


I. INTRODUCTION 


Structures constructed in seismic regions are required to function 
properly in a forcing environment characterized by random earthquake 
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occurrences and intensities. The seismic environment including the 
expected earthquake-magnitude levels and the corresponding frequency 
of occurrence for different seismic-risk zones was described previously.’ 
The study was based on a statistical analysis of nationwide seismic 
data and may be used as a guide for the development of seismic design 
requirements on a global basis. Under localized situations, however, 
the seismic requirement for structures that are expected to adequately 
withstand the earthquake environments should be based on cost- 
reliability studies. During an earthquake of given intensity, there exists 
a probability that the response of the structure is greater than its 
resistance capability and, therefore, a probability of damage to the 
structure. The cost associated with this probable damage may be 
referred to as the ‘earthquake risk cost’’. Increasing the design intensity 
of the structure reduces the probability of damage, but at an increased 
cost of construction. Therefore, an optimal design earthquake intensity 
can be determined by achieving an appropriate balance between the 
construction cost and the earthquake risk cost. 

This paper presents a new analytical approach to the determination 
of the economically optimal earthquake intensity or other design 
variables for structures. The construction and earthquake risk costs 
are expressed in terms of design intensity and other parameters re- 
flecting the earthquake and structural characteristics. Minimization 
of the total expected cost of the structure yields the optimum structural 
design intensity in terms of such parameters as estimated cost of 
earthquake damage, unit construction cost, expected earthquake 
duration, and statistics obtained from seismological data for the par- 
ticular site. 


II. ANALYSIS OF DESIGN INTENSITY MODEL 


2.1 Objective Function 


Consider a certain seismic region in which a structure is located. 
Let K,(2,) and K,(z,) represent the construction and the earthquake 
risk or damage costs of the structure respectively, both being functions 
of the design intensity 7, measured from I to XII on the Modified 
Mercalli scale. The function K.(z,) may be regarded as a monotonically 
increasing function of 7, , while the function K.(z,), as would be expected, 
is a monotonically decreasing function of 7, . 

The optimum design intensity 7*, may be obtained as a trade-off 
between these two functions by minimizing the total cost 


K(t,) = K.@) + Ka(t,) (1) 
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Fig. 1—Sketch showing cost of structure against design intensity of earthquake. 


as shown in Fig. 1. The function K(z,) in eq. (1) is the objective function 
and 2, is the decision variable. 
The construction cost, K,(z,), can be written as 


K.(t,) = A,(f + C) (2) 


where A, is the floor area of building, f is the building cost per unit 
floor area, and C is the cost of earthquake protection per unit floor 
area. The earthquake protection cost obviously increases with the 
protection level, such as 7, in this case. Therefore, C = C(z,) is a mono- 
tonically increasing function of 7, . It should be noted that the deter- 
mination of the function C(z,) depends on the type of structure, method 
of design, and is greatly influenced by the designer’s personal judgment 
and experience. A reasonable first approximation can be made assuming 
C(z,) is linear with a coefficient c. Under this assumption eq. (2) can be 
written as 


K.(t,) = A,(f + et.) (3) 


where A,f equals a fixed initial cost of building, and c is the earthquake 
resistance cost of the building per unit floor areca per unit intensity. 

To determine the function K,(z,), analytical procedures can be 
effectively employed utilizing some basic knowledge about the structure 
and the random forcing environment. Let N(t) be a random variable 
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representing the total number of earthquakes to occur in time ¢. Further- 
more, fork = 1,2,--- , N(é), let p, be the probability that the structure 
fails given, the kth earthquake occurs, and Iet e, , an identically dis- 
tributed random variable, be the associated loss. Therefore, the total 
cost of earthquake damage, Z, is 


N(t) 


Z = CxPr : (4) 


k= 


va 


_ 


Taking expectation (denoted by overbar) of both sides of eq. (4), one 
obtains 


Z = N(bep (5) 


in which N(t) equals the expected number of earthquakes (of. all in- 
tensities) in time t, é equals the expected value of the random earth- 
quake loss, and # equals the mean failure probability of structure 
given an earthquake occurs. The present worth of the expected value 
of Z is the earthquake risk cost K.(z,): 


Kili.) = Zot) = N(epg@) (6) 


where g(t) is a discount factor. 

The quantities on the right-hand side of eq. (6) will be discussed next 
in terms of the related design parameters which fall into two categories: 
the earthquake parameters and the building parameters. The earthquake 
parameters include the regional seismicity constants, the earthquake 
magnitude (m), intensity (7), duration (/,), amplitude (a) of the ground 
motions, and the statistics of these quantities. The building parameters 
include the mass (p), stiffness (k), natural frequency (w,), damping (é,), 
height (A), the resistance or strength (x) of the structure, etc. 


2.2 The Average Number of Earthquakes, N (t) 


The quantity \’(é) depends on and can be estimated from the regional 
seismicity. Earthquakes can be considered to be a series of events 
randomly distributed on a real line (representing time), and the sequence 
of original times {t,} forms a point process.” It is further assumed that 
the joint statistics of the respective number of shocks in any set of 
intervals are invariant under a translation of these intervals; this 
implies that {é,} is a stationary point process. The stationary point 
process generalizes certain aspects of renewal processes; in particular, 
the interval lengths 7, = ¢, — t,; between successive events need not 
be independently or identically distributed. 
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The simplest stationary point process is the Poisson process. In- 
tuitively, the process {t,} can be approximated as a Poisson process 
if it represents rare events. More rigorously, it requires that 7, be 
independently and identically distributed and follow a negative expo- 
nential function. The main deficiency of the simple Poisson model is 
its inability to describe the aftershocks which are often triggered by 
a large main shock. However, for most practical engineering purposes, 
the simple Poisson model for earthquakes appears to be adequate. 
In practice, an engineer is concerned with the earthquake risk of struc- 
tures located in some specific geographic areas. The risk depends heavily 
on the statistics of large earthquakes in these areas, and the omission 
of small earthquakes or aftershock processes is relatively unimportant 
in terms of earthquake risk. 

If {N(t); t = O} is assumed to be Poisson with a constant rate, a, 
then 


KObING Sa oN ee (7) 


and 
N(@ = at. (8) 


The parameter a per unit time ¢ can be determined from regional 
seismicity data.’ 


2.3 The Mean Failure Probability of Structure, p 


The quantity p depends on both earthquake and building parameters. 
The earthquake intensity ‘7’ to which the structure is designed will 
be the only decision variable considered in this formulation and all 
other parameters are assumed known. Let Y(t) = {max | y(é)|; ¢ © 
[0, ¢.]}, where, ¢, equals the duration of the structural vibration which 
is assumed approximately equal to the duration of the earthquake 
ground motion, and y(t) equals the response parameter (displacement, 
velocity, acceleration, stress, etc.) of the structure. For an earthquake 
with intensity 7, failure could occur when the resulting structural 
response y(t) equals or exceeds the actual resistance, x, of the structure. 
The corresponding failure probability, p(z), can be expressed by 


p(t) = prob [Y = x| earthquake with intensity 7 has occurred]. (9) 


The quantity p(z) is a function of the random variable 7 representing 
the earthquake intensity whose probability density function f;(z) can 
be found in terms of the regional seismicity and earthquake source 
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geometry. Since 2, in the Modified Mercalli intensity scale, takes only 
discrete integer values from one to twelve, the mean failure probability of 
structure, # is given by 


12 

B= DO phi), (10) 
in which 2, equals the design earthquake intensity. From eqs. (9) and 
(10) it is clear that earthquake parameters enter the formulation of the 
problem through p(z) and f,(7), while building parameters enter the 

formulation through p(z) only. 
The density function f;(z) can be derived from an expression obtained 
by Cornell’ for the distribution function of earthquake intensity 7: 


hay Sie ; To ea (-£;). (11) 


The governing assumptions for eq. (11) are: 


(z) The earthquake magnitude m is a random variable with in- 
dependent and negative-exponential distribution function 


Fy(m) =1—- e°" m=m, (12) 


where m, is some magnitude small enough, say 4, that events of lesser 
magnitude may be ignored by engineers, and 8 is a constant the inverse 
and inverse square of which represent the mean and variance of earth- 
quake magnitude m > 0 respectively; 

(zz) The intensity attenuation law is given by 


1 = ¢( + CoM — Cz In r (13) 


in which c, , ¢2 and cs are regional seismicity constants,' and 7, the 
focal distance in miles, is the random variable representing the distance 
from the structural site to the location of an earthquake source on the 
fault line. 

(ia) The earthquake has a line source (fault line) of length 7? Gn 
miles) with uniform distribution. 
The parameters I and J are given by: 


exp | 6(% + m.) | (14) 


C3 


ie: dr 
al ape age: te oe 


r 


t For example, c:, 7 = 1, 2, 3 are semi-empirical constants on the order of 8, 1.5, 
and 2.5, respectively, for firm ground in Southern California.‘ 
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in which d = minrand?r, = max r. 
It follows from eq. (11) that 


2.4 Determination of the Failure Probability of Structure, p(1) 


To find the structure’s failure probability p(z), as defined by eq. (9), 
it is necessary to specify the failure mechanisms of the structure. It. 
is also necessary to establish stochastic models for the response pa- 
rameters y(t) and the corresponding resistance x of the structure. For a 
linear and deterministic structure which is assumed to experience no 
plastic deformations and the properties of which are governed by given 
constants, the response model can be obtained given a stochastic model 
for the input earthquake ground motion. For the case of random 
strength, x, it becomes necessary to determine the distribution function 
Fx(x) based on statistical and laboratory tests on individual building 
components as related to the overall structural resistance, e.g., 
Kennedy.” Similar tests were used by Freudenthal and Wang to estab- 
lish a representative distribution of the ultimate strength of aircraft 
structures.” 

In this study, consideration will be limited to the first excursion 
failure only, and the input and response of the structure are both 
treated as random processes. The probability of the duration of the 
response amplitude excursion and other failure mechanisms such as 
wearing and fatigue are not considered. In the first excursion failure, 
a structure is said to have failed if the response parameter, y(t), exceeds 
a prescribed resistance or strength level, x, during the vibration cycles 
caused by the earthquake. Let the duration of the structural vibration 
be approximated by #, , then, for any ¢ ¢ [0, ¢,] 


pt) = prob[ | y | 22,05 tS t,] 
=l1- W t,) 


in which W,(t,) = prob [| y(é)| < 2;0 S$ ¢ S ¢,] = the reliability of 
the system. Two different situations in the structure’s resistance char- 
acteristics will be considered below. 


(17) 


2.4.1 Deterministic Resistance Variable, x 


A random process model for the response of structures subjected to 
a stationary earthquake excitation can be established as follows. A 
simple structure can generally be treated as a lightly damped linear 
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oscillator and its response, y(t), is related by a second-order differential 
equation to the excitation, e.g., a;(#), the ground acceleration of an 
earthquake with intensity 7. A multistory structure can be treated 
similarly in generalized coordinates considering normal mode vibrations. 
The structural response y(é) in our case is a Gaussian process which 
approaches stationarity after a few cycles of initial transient motions. 
Let p,(t)dt denote the probability that y(t) exceeds the threshold y = x 
during the interval [t, ¢ + dé] for the first time since the initial time 
t = 0. The probability density function p,(¢), referred to as the first- 
crossing density, is related to the reliability function by —(dW/dt) = 
pi(t). While establishing the precise behavior of p,(¢) for small t poses 
some difficulty, for most practical purposes some approximations can 
be made for large mean failure time ¢. The simplest approximation to 
the first-crossing density is to assume that the up-crossings of the 
threshold occur rarely in the stationary response, so that they can be 
considered as statistically independent events. If so, the instants at 
which | y(é) | cross the level x from below would constitute a Poisson 
process with a constant rate 2y, , where v, is the level crossing rate of 
y(t) at the level y = x. In this situation it can be easily shown’ that 


pti) = 1 — exp (—2r,t,) 


= 1 — exp [—2»,t, exp (—2”/207)] (18) 
in which vy, = o,/(ro,) = zero crossing rate of y(t); v» = rly. = 
vy, exp (—2"/207); where ¢, = standard deviation of y(t), ¢, = the 


standard deviation of y(t) = dy(t)/dt. All these quantities are dependent 
on earthquake parameters (therefore, on intensity 7) and building 
parameters. These dependences will be derived later in this study. 
More specifically, it will be shown that o, and oc, are directly proportional 
to 7 and that v, is a constant. 

Expressions for p(z) under other assumptions on the response process 
are presented in Appendix A. 


2.4.2 Random Resistance Variable, x 
The resistance variable x is a random variable with probability 
density function fx(x). In this situation, the failure probability is 
given by 
pit) = / prob [max | y(t) | 2 27;0 St S #,|fx(x) dv. (19) 
0 t 


Let Y(t.) = (max, | y()|, 0 S ¢ S t,); and N,(t) = the number of 
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peaks of | y(¢) | in the time ¢; then {N,(é); 0 S ¢ S t,} is a random 
process. Assuming it is a stationary Poisson process of intensity X, , 
it follows from eq. (18) that \, = 2v,. For situations as described in 
Appendix A, \, = 2v,1n[1 — exp (—2’/20?)] from assumption (2); 
and i, = 20, 2,2/c, from assumption (iz). 

The first excursion probability can be expressed in terms of \, as: 





prob [Y(t,) S xz] = exp [—),t,(1 — F.(x))] (20) 


in which Fy(x) = f*,, fx(x’) dz’. From eqs. (19) and (20) the expression 
for p(t) becomes 


pi) = [> (1 exp [=a = Fra) xa) de.) 


A closed form solution of eq. (21) is possible when f,x(t) has a simple 
expression such as a uniform, Gaussian, or Rayleigh density function. 
In general, eq. (21) can be conveniently solved by numerical integration. 


2.5 Ground Motion Statistics 


The statistics characterizing the random ground motion shall now 
be brought into the formulation. Let a;(t) be the ground accelerations 
of earthquakes of intensity 7 and assume {a;(t);0 S ¢ S #,} be a sta- 
tionary process with a power spectral density function G,;(w), where 
w is the frequency variable. Such a stochastic ground motion model 
has been proposed and used extensively, e.g., by Liu® and by Jennings 
et al.° Further, assume that the process {a,(t)} is a filtered white noise 
with a constant power spectrum density G per unit intensity, and that 
the ground filter is a linear, single-mode oscillator with constant fre- 
quency and damping characteristic values w, and £,, respectively. The 
following is a derivation of eq. (22) showing the direct relationship 
between G,,;(w) and intensity 2. 


TG 

Gi (w) = (w? a7 wey ra AP wwe (22) 

The earthquake intensity value is a measure of the damage potential 
which is represented by the corresponding response spectrum S, = 
max, | y(t) | for a structure with natural frequency and damping pa- 
rameters w, and &, . Therefore, S, = S,(z, w, , & , t.) is clearly an in- 
creasing function of 7. The precise functional relationship between 
S, and 7 is not yet known, but can be obtained from data fittings of 
calculated response spectral values of past earthquakes with known 17. 
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A simple linear approximation may be made for our analysis by assuming 
S,(, Wo, &,t,) = ke (23) 


where k is a constant of proportionality. From the ground acceleration 
model defined above, it can be shown’? that 


S, = o Ka, (24) 
where 
K = (21n»,t,)? + 0.577(2 In »,t,)7?. (25) 


Since K is independent of 2, as will be shown later in this section, 
it is obvious that 


G,:(w) = 7G,(w) (26) 


satisfies eqs. (23) and (24); and according to the ground motion model 
as defined earlier, G,(w) is given’ by 


G 
Finally, eq. (22) follows directly from eqs. (26) and (27). 

It may be noted from eqs. (23), (24), and the relation o? = f%,, G.,() 
| H(w) |? dw, that the power spectrum density G,;(w) of the earthquake 
process is proportional to 2’, which agrees with eq. (26). Housner and 
Jennings'’ have used the relationship G,;(w) = const. S? , which also 
leads to our assertion of eq. (26). 

A difficulty exists in determining the value of the constant spectral 
density G corresponding to a unit Modified Mercalli intensity level. 
Because the intensity cannot be precisely related to the earthquake 
waveform parameters such as the amplitude of acceleration, velocity, 
displacement, response spectrum intensity, etc., some normalization 
procedures based on these parameters must be used to determine G. 
For example, a constant power spectral density level for the input 
white noise to the ground filter is determined by matching the cor- 
responding expected velocity spectra of the filter’s response to Housner’s 
average velocity spectra.” 

Using the well-known relation o” = [%,, G(w) dw, (i.e., the variance 
of a random process is equal to the integral of its power spectral density 
over the entire real line representing frequency), it follows from eq. (24) 
that the variances of a,(t) and d;(t) are respectively 02 = 1°Gm/(2&,w?) 
and o; = wo, . Also, from the relation G,(w} = | H(w) |’ G.;(#) in 
which H(w) = (w? — w? — 2jt,w,w)”* = the transfer function of the 


Gow) = (27) 
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simple structure for displacement output y(¢) and input a;(t), where 
j represents the complex unit, it can be shown that’ 


aGB + . 
«= (Gage aa) is - 
2 1GAs Yi ae 
0 = (qt) i = (29) 


in which Ay = 5s ) A, = 20,04 (EW, ar £ Wo), A, = ws a ws + 4E,F,0.W, 5 
Az; = 2(E,w, + £,,) and B = AoA; = A, . 
From eqs. (28) and (29), the zero crossing rate of y(t) is 


vy, = 0,/4o, = 1 (Acds) 
: OE? ae Ne 


(30) 
To show that K is independent of 7 in eq. (25), it is sufficient to show 
that v, is likewise independent of z. This is obvious from eq. (30). 

This completes the discussion on the determination of the failure 
probability of structure p(z). It is shown that eqs. (18), (21), and eqs. 
(37) through (39) in Appendix A define p(2) for various failure mechan- 
isms. Furthermore, substituting eqs. (27) through (30) in the appropriate 
terms in p(z), indicates that p(z) is a function of intensity 7. Finally, sub- 
stituting eq. (16) and various expressions for p(z) into eq. (10) deter- 
mines j, which is a function of 7, . 


2.6 The Expected Random Earthquake Loss, é 


The earthquake loss depends on earthquake and building parameters, 
and the extent to which human lives are in danger. The quantity @ 
can be determined from statistical data of actual earthquake damage. 
Unfortunately, empirical values of é for different classes of constructions 
are not presently available. It is logical to assume that @ is directly 
proportional to the damage potential of earthquakes, therefore, either 
of the following relationships, or their combinations, may be appropriate: 


é = C, max a,(t) (31a) 
& = Cot, (31b) 
é= C,{S, , or S, , or S;}, (31c) 


in which C, , C, , C3 are the constants of proportionality and S repre- 
sents the expected response spectrum associated with the subscript 
response parameter. Equation (3lc) in which é is expressed in terms of 
the expected velocity spectrum S, appears superior to others because 
the effects of the amplitude, duration as well as the frequency char- 
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acteristics of the earthquake accelerogram, are all considered. Thus it 
will be used in the following analysis. Let C; = ¢A,f, where e is the per- 
cent loss in building cost per unit response velocity spectrum and ef is the 
expected earthquake loss per unit floor area per unit response spectrum. 
It should be noted that no upper bounds for e can be established in situa- 
tions where human lives are involved. In these situations, the value of 
would increase for large-occupancy structures such as hospitals, schools 
and office buildings, etc., and decrease for small-occupancy structures 
such as warehouses, machine rooms, unmanned equipment buildings, etc. 
The determination of «, with due considerations to loss of human lives, 
needs actual earthquake life-loss statistics and a mathematical model 
which converts life-loss into dollars. These matters will need further 
studies and more data collection. Clearly, expressing é@ in terms of 
initial building investment is a convenient way of incorporating all 
possible losses in an earthquake environment. A sensitivity analysis 
for the parameter e should provide some insight to the overall cost 
structure. From eqs. (24) and (31c) 


é= Cw, KG, 
cA fo,K 0, 2 


(32) 


ll 


where 
bs 12 
i= 2d if,(i). 

It can be noted from the above that é is independent. of the design 
intensity 7, and the expected service life of a building. This is because 
according to its definition, @ is the expected loss associated with a 
“single” random event. The quantity é@ should not be confused with 
the total expected loss of building (see Appendix B for its derivation) 
throughout the entire service life, which should be expected to increase 
with service life but decrease with design intensity. 


2.7 Earthquake Risk Cost 


The expression for the earthquake loss function K,(7,), for the case 
of deterministic structural resistance and independent crossings of 
response process can now be established. Substituting eqs. (8), (32), and 
(10) [f;(2), p(2) and oa, given by eqs. (16), (18) and (28)] into eq. (6) 
leads to: 

2 12 12 
K.(i,.) = ated fo.K8,9(0( 2) Se 


le, i=1 i=io 


-[1 — exp (—2t.,e 7 7" )], t,=1,2,-°--,12 (33) 
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in which K and », are given by eq. (25) and eq. (30) respectively. To show 
eq. (38) is a monotonically decreasing function of 2, , let 


eled fe. 10,9(0( 2 y = 25g. 





12 2 
Br B xv 
> 2 exp (—&) = Q,, —= 6, 2 28 = ¢, 





C2 
and 
Kali.) _ ; 
20, ~ Z,(t,). 
Equation (33) is rewritten as 
12 
Z(t.) = >> ~ = Ts = T ;. ) | 
k=i9 
a = e"[L — exp (—ue™)], | a 
z2,=0, «k,a = 1, 2, 12 


where T;, = >oi2, & is a function of 7, and 71. = > 2, % is a constant. 
Notice in eq. (34) that for 6 and ¢ = 0, both exp (— 6k) and exp (—{/k’) 
are bounded between zero and unity; furthermore, for » 2 O, the 
quantities exp [— exp (—¢/k’)] and 1 — exp [—y exp (—{/k’)] are 
also bounded between zero and one. It is apparent that since 0 S 2, S 1 
and T; = > j., & = 0, therefore Z, is a monotonically decreasing 
function of 7, , as is expected. 

Substituting eqs. (3) and (34) into eq. (1), the final expression for the 


objective function is obtained: 
Ki.) = K.(,) ca Kitt.) 
— A,(f + Cl.) + 0,2.Z,(2,) 


in which the first term in the right-hand side increases with 7, and the 
second term decreases with 7, . The optimum intensity 7* is determined 
by setting equal to zero the first derivarive of eq. (35) with respect to 
design intensity 7, , i.c., 


(35) 


OZ (t) _ «== Ase 
a, 42 (36) 





Alternatively, 7% can be obtained from eq. (35) by direct, numerical 
evaluation of the function K(z,) for all 2, 

As an illustration of the presented spproadl a hypothetical building 
design problem is numerically analyzed for its optimal design earth- 
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quake intensity. Figure 2 is a plot of 7, versus the normalized total cost 
K(t,)/A,;f computed using the following data: The cost data, « = 
1 percent, g(t) = 5 percent; the building data, « = deflection = 0.9 X 
10°* ft; the earthquake data, 1 = 50 miles, a = 3 earthquakes per 
year, 8 = 2,t, = 25s,t = 40 yrs, c, = 8.16, co = 1.45, c; = 2.46, m, = 5, 
w, = 31.4 rad/s, &, = 0.5, w, = 3.14 rad/s, &, = 0.5, and d = 40 mi, 
and G = 1.6 g’s °. Although these numerical values are used for 
illustration purposes, nevertheless, the earthquake parameters reflect 
realistic data based on seismicity in Southern California. Four different 
seismic protective ratios c/f = 0.01 to 0.04 are considered. The results 
indicate that for this specific design problem, the optimal design in- 
tensity for the building is VII. 


NORMALIZED COSTS 





I Ir It Iv Avs VW Vit Vur It x XI XI 
DESIGN EARTHQUAKE INTENSITY, ig 


Fig. 2—Optimum design earthquake intensity analysis for a sample building. 
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A preliminary sensitivity analysis on this numerical example indicates 
that the convexity of the total cost function becomes more apparent as 
the building resistance parameter x increases (in this case building 
deflection measured in feet). For the problem under consideration, the 
value of x above which the design intensity could be established is 
found to be 0.7 X 10°” feet. For smaller values of z, the cost ratio 
K(i,.)/A,;f becomes insensitive to design intensity 7, and ratio c/f as 
the total cost curve becomes flat for 7, > V. 


III. CONCLUSION 


A simple mathematical approach is presented to determine the 
optimal design intensity of earthquakes for structures. The objective 
function to be optimized is taken as the total construction cost of the 
structure plus the expected cost of earthquake damage throughout 
the entire service life of the structure. For the case of deterministic 
structural resistance and probabilistic structural response with Poisson 
(independent) crossings, the objective function is derived in terms of 
the building and earthquake variables. The optimal design intensity 
can then be determined by minimizing the objective function with 
respect to the intensity variable. Other optimum design variables can 
also be obtained by simply regarding them as the decision variables 
in the objective function and by performing optimization analysis. 
The resulting equations are relatively simple and can be easily handled 
for numerical studies and sensitivity analysis. Generalizations of the 
results for nondeterministic structural resistance and for response 
processes different from those having Poisson crossings are also in- 
dicated. 
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APPENDIX A 


Expressions for p(t) under Different Assumptions 


(tz) Independent Peaks—The dispersion in the number of peaks 
of the narrowband response y(t) is neglected and the magnitudes of 
these peaks are assumed to be statistically independent variables, 
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and each having the probability distribution P[| y |e. < z] = 1 —- 
(v,/v.), then’ 


p(t) = 1 — exp [—2»,t, n (1 — e°*'”"’)). (37) 


(it) Independent Envelope Crossings—The crossings of the envelope 
of y(t) are assumed to be independent and in this situation’ 


p(i) = 1 — exp [—2(2&.)*(#/o,)t, exp (—a"/20%)] for & <K1 (88) 


in which &, equals the damping ratio of the structure. 

(17) Two-State Markov-Process Assumption'*—The successive in- 
tervals that the envelope of y(t) spends above and below the level x 
are assumed to be random variables with exponential distributions. 
In this case 


p(t) = 1 — exp [—(1 — »,/»,) exp (— 6.t.)] (39) 
where 
6, = n(1 — v,/v,)* 
and n, = J% p(r, r)7 dr |,-, is the envelope crossing rate of y(t). 
APPENDIX B 


Expressions for Failure Probability, p(t), and Total Expected Loss of 
Building, D 

Let h(t) be the expected loss in case of failure and p(t) be the failure 
probability density function for the building, then 


De / ” apd) dt. (40) 


where tp is the service life of the building. From the logic leading to 
eq. (32), and assuming a discount factor of cost g(f) = 1 — (t/tp), the 
function h(t) can be written as 


ht) = Ki.)(n + 9g) (41) 


where 7 is percent of construction cost, representing the earthquake 
loss. The failure density p(t) = dP(é)/dt, where P(t) is the failure 
probability of the building and is given by 


12 


PO) =1- YL prob NO = nla — pO) 2) 


t=to 
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where p(z) is given by eqs. (18) and (21), and N;(é) equals the total 
(random) number of earthquakes of intensity 7 in ¢ years. According to 
eq. (7) 


Probie a= I exp (— AF at)(AF at)" (43) 


where AF; is the probability that given an earthquake occurs, this 
earthquake has an intensity equal to 7. For a linear source of earth- 
quakes it follows from eq. (11) that 


AF; = TJ exp (—8)) — exp (=2)). (44) 


Equations (40) through (44) completely define the total loss expectation, 
D, and from above, it is obvious that D increases with tp and decreases 
with 2, . 
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